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1 Chapter 1: Introduction 



While both quantum mechanics and gravity have been repeatedly tested and 
verified, combining the two into a single theory has not been done to this 
day. Due to the fact that the strength of the gravitational interaction between 
elementary particles is many magnitudes smaller than that of any other in- 
teraction, there are no experiments that detect gravity at the quantum scale. 
At the same time, once we are dealing with objects heavy enough for their 
gravity to become detectable, they are too large for any quantum effects to 
be observed. Thus, there is no scale at which both quantum mechanics and 
gravity are significant. This leaves no experimental data from which to start 
in building a quantum theory of gravity. 

Ironically, despite the fact that the gravitational field is very weak, the 
fact that general relativity identifies gravity with geometry makes the gravi- 
tational field more important than other fields, since the latter need geometry 
to propagate. Thus, even though we do not need to combine quantum field 
theory with gravity in order to explain outcomes of experiments performed so 
far, a theory combining the two is important in order to answer a number of 
conceptual questions. In particular, if gravity did not exist, the "fuzziness" of 
quantum mechanics, albeit counterintuitive, would still have been well defined 
since "fuzzy" particles would have existed in non-fuzzy spacetime. But due 
to the quantization of gravity, spacetime becomes fuzzy as well, leaving no 
non-fuzzy background. 
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String theory provides one way of answering this question. The string 
spectrum generates all the known particles, including gravitons. The latter 
replace spacetime curvature in defining gravity. However, string theory has 
made a number of predictions that, due to their very small scale, are not 
verifiable in the lab to this day. The possibility of both positive and negative 
outcomes makes it both a prime pro as well as prime con when it comes to 
pursuing the research in this field. 

The main alternative to string theory is spacetime discretization. This 
approach takes the traditional definitions of quantum field theory and grav- 
ity and attempts to combine them directly. Even in regular non-gravitational 
quantum field theory, in order to make rigorous sense of path integrals we have 
to introduce a lattice (see Zee's book on quantum field theory [1]). Coordi- 
nates of a point will then be defined in terms of a position of that point in 
a lattice, which conceptually means that the bond structure is fundamental, 
while coordinates are only a bi-product. 

Strictly speaking, coordinates simply designate a label, there is nothing 
new in saying that they come secondary to actual geometry. However, in 
practice they almost always correspond to manifold structure. Thus, the idea 
of making them secondary invites one to think that manifold structure comes 
second to something more basic, which ultimately invites us to abandon the 
manifold structure. 

Strictly speaking, a lattice is not a manifold to start with, since the former 
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is discrete and the latter is continuous. But intuitively it can still be viewed 
as manifold-like. Its manifoldlike appearance is a consequence of the specifics 
of bond structure. Since one can think of other kinds of bond structures that 
don't resemble a manifold, one is hard pressed to invent a specific Lagrangian 
that would "ahgn" the "molecules" of spacetime into the manifold. In order to 
be able to do that, the Lagrangian has to be well defined for all structures, not 
necessarily manifold-like. This implies that manifold-ness is not fundamental. 

In this framework, the difference between curved and fiat spacetime might 
be simply the difference in that bond structure. Thus, the presence of gravity 
means that the "solid" spacetime is allowed to "bend" . However, in the quan- 
tum context, this means introducing extra degrees of freedom. These extra 
degrees of freedom enable spacetime not only to "bend" but also to "melt" into 
"liquid" . In the latter case, instead of being "curved" it would loose manifold 
structure altogether. 

This is consistent with the uncertainty principle: at small scales, the un- 
certainty of the gravitational field should increase to infinity. On the other 
hand, that same uncertainty principle allows the gravitational field to be less 
uncertain at larger scales, which would explain why the observed universe is 
manifold-like. However, it is not clear just how can this happen: how can 
very non-manifoldlike building blocks line together into a manifold? Engi- 
neering a spacetime that meets the above description is the ultimate challenge 
of quantum gravity. 
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In most such approaches to quantum gravity, the spacetime is built ac- 
cording to some rules. The dynamical triangulation model is one example of 
such approaches, in which one unwanted side effect is the violation of relativ- 
ity due to the fact that simplices, or other such structures, are not Lorentz 
symmetric. Causal set theory, on the other hand, avoids all such violations 
of relativity by avoiding postulating such structures from the start. In other 
words, while most discrete approaches to quantum gravity replace a "solid" 
with a "hquid" , causal set theory replaces it with a "gas" . The goal of causal 
set theory is to develop a non-manifold generalization of a Poisson distribu- 
tion of points, since this is the only discrete structure that is relativistically 
invariant. 

While causal set theory still uses links between the points, these hnks are 
interpreted as light-cone causal relations as opposed to spatial bonds. In other 
words, classically, signals can travel only along these links, in the directions 
specified on each link. These links are referred to as "causal relations". The 
fact that causal relations are the only fundamental structure, corresponds to 
the principle of relativity that the speed of light is invariant. In order for 
this to resemble a Poisson distribution, the causal relations are determined at 
random: each point can have an arbitrary number of links, and the presence or 
absence of causal relations between any of the pairs of points is independent of 
the presence or absence of causal relations between any other pairs of points. 

This approach is motivated by an observation made by Hawking that if 
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we have a Lorentzian manifold, then the metric can be completely determined 
based on Weyl scaling and light cones (causal relations) alone. Here, "Weyl 
scahng" refers to the information about the volumes of the regions of space- 
time, and "causal relations" refer to an information as to whether or not one 
can travel from one given point to another without going faster than the speed 
of light. 

In the discrete case, Weyl scaling is defined by a simple count of points, 
since each point is assumed to take up exactly the same volume. Thus, causal 
relations alone are now defining features of a metric. This idea is beautiful 
quite independently of an issue of quantum gravity. After all, causal relations 
have a much more basic physical significance than a metric tensor, and in the 
discrete case the result shows that spacetime geometry can be described in a 
purely combinatorial way. 

However, there is a gap in the above argument. Namely, Hawking's ob- 
servation only applies to the situations where we already know we are dealing 
with manifold, we just don't know the metric. According to the proposal of 
this dissertation, the issue can be addressed by replacing the stochastic process 
with a Lagrangian-based dynamics as a determining factor of causal relations. 
This is motivated by a well established fact of general relativity, that geometry 
and gravitational field are the same thing. In light of the fact that we now 
identify geometry with causal relations, the natural conclusion is that causal 
relations are, too, identified with gravitational field. As such, they can be 
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thought of as subject to various Lagrangians, which may mean that applying 
a pure stochastic process is not the way to go. 

Designing such Lagrangians is the main goal of this dissertation. Its con- 
tents are divided into three parts. In the first part, Lagrangians are defined 
for gravity and various types of matter fields. In the second part, a model of 
quantum "collapse" is designed. One of the applications of that model is a 
way to "collapse" the causal relations into some background topology which 
is needed in order to be able to go from Lagrangian to propagator. Finally, 
we have to argue that after causal relations are subjected to such "collapse" 
they would collapse into manifold-like ones with very high probability. The 
first step was done in body of the thesis, while the second and third parts, due 
to their controversial nature, are done in the Appendix. 
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2 Chapter 2: Structure of a Causal Set 



2.1 Introduction 

The purpose of this chapter is to discuss the basic geometry of a causal set. 
In Section 2.2 distances and volumes will be defined on a causal set. For time 
being, it will be assumed that the causal set is given from the start; questions 
such as where causal set came from and why it has the structure that it does 
are left for Sections 2.5-2.7. In these latter sections, two alternative models 
of the origin of causal structure will be presented. One is dynamical, where 
events are being added one by one and causal relations are decided in the 
process; this is called "classical sequential growth dynamics" , and the best- 
studied model of this type is known as "transitive percolation" (Ref [IE])- The 
other approach is static, where all possible versions of a "completed" causal 
set will be considered. That is, it will be assumed that all events and their 
causal relations are there from the start (perhaps each possibility existing in 
its own parallel universe) and probabilities are assigned to them. 

In Section 2.5 it will be shown that the transitive percolation model leads 
to unwanted "big bangs" (which are referred to as "posts"). This is blamed on 
the fact that the transitive percolation model is purely statistical and does not 
incorporate any Lagrangians. For that reason, in Section 2.7 an alternative 
will be presented: A Lagrangian-determined structure for a causal set. For 
that reason, the notion of stochastic process will be abandoned as redundant. 
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However, in Section 2.7 it will be assumed assumed that geometry, de- 
termined by the causal structure, is a quantum field (namely gravity). The 
fiuctuating geometry collapses into a fixed one. This will require an interpre- 
tation of quantum mechanics, specifically a theory of quantum measurement. 
The proposed model of quantum mechanics will be a theory of "quantum cor- 
ridors" (Chapter 5 of Ref [H]). According to that model, a quantum system, 
including causal relations, is being continuously measured while the measure- 
ment is not precise; the measurement error allows for quantum fiuctuations 
and determines a range of path integral. 

Qualitative arguments regarding the compatibility of the above model 
with decoherence theory are left for the Appendix. While this will no longer 
pertain to causal sets, this part is vital in terms of justifying the causal set 
construction of Section 2.7, which is the only proposed model of topology of a 
causal set as far as this dissertation is concerned. 

2.2 Volumes and Distances on a Causal Set 

Due to the discreteness of causal sets, volumes are straightforward to define. In 
general, the definitions for a general causal set are motivated by observations 
of Poisson distribution of points in Lorentzian manifold. In the latter case, 
the average volume taken up by a single point is some fixed number, vq. The 
variation of that value is due to the fiuctuations of the density of scattering. 
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In case of causal set, however, the points are no longer assumed to be 
embedded in a manifold; thus, it no longer makes sense to speak of variation 
of their density. Therefore, it is assumed that vq is an exact value of a volume 
taken up by every single point, rather than the average. Typically, this is 
assumed to be a volume based on the Planck scale, although there is no need 
to make this specific assumption. This implies 

V{T)^vo^T, (2.1) 

where T is a subset of S and jjT stands for number of elements of T. Since 
there is no continuum, there is no well-defined notion of a "region" . Thus, the 
definition of a volume applies to any T G S. 

A definition of distance, on the other hand, requires the notion of sequences 
of points that are discrete analogues of geodesies. The number of points on the 
segments of these curves defines their length. Consider a flat Minkowski space 
and two timelike-separated points in that space, and rotate the coordinate 
system in such a way that the two points of interest are lying on i-axis, with 
coordinates ti and ^2- Let 7 be an arbitrary future-directed curve that connects 
them. The length of 7 is given by 

/(7)= / i/(dt)^^^^(d^ < / |dt|= / dt = h-h = r{p,q). (2.2) 

The second equal sign in the above equation is based on the assumption that 
the curve is future-directed. Thus, while it is not true that the length of every 
single curve that connects p and q is less than the Lorentzian distance between 
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them, the statement is true for future-directed curves. This, for example, rules 
out the scenario involving traveling to the future or back either arbitrary far 
or arbitrary many times, resulting in a curve of arbitrarily large length. 

The above statement can be easily adjusted to the discrete case of causal 
sets (see Refs [H] and [I5]). A future-directed curve can be viewed as a future- 
directed set of points, p ^ ri ^ ... ^ r„ -< g. Selecting the longest possible 
curve corresponds to selecting a chain of points of maximal cardinality. This 
has a side benefit: maximizing the number of points excludes the possibility 
of removing segments of a curve and/or skipping points at random. Thus, the 
points are spaced as densely as the discretization allows, making sure that the 
chain of points approximates a continuous curve. 

Thus, the definition of a distance on a causal set can be summarized as 
follows: 

t{p, q) = ma.x{n \ 3ri, r„ : p -< ri ^ ... -< r„ ^ g} . (2.3) 

Numerical studies (see Refs [H] and |15j) confirmed that in the case of a 
Poisson distribution of points on a Lorentzian manifold there is, in fact, a 
close correlation between the Lorentzian distance defined in the usual way 
and the one defined in terms of chains of points as above. The coefficient of 
proportionality, however, is still unknown. 

There was recent work done in [T7j where they tried to define distances 
between two space-like events. This was done under the assumption that pairs 
of points in interest are close enough to each other that spacetime in that 
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region looks flat. 

First let us describe an approach that [T7j discarded and explain their 
reason, and then, afterwords, describe the approach that they later advocated. 
Suppose p and q are spacelike-separated events separated a distance a apart. 
Choose the coordinate system so that they both lie on z-axis and origin is in 
the middle. Thus, p = (—a/2, 0, 0, 0) and q = (cr/2, 0, 0, 0). Suppose there are 
points r and s satisfying r -< p -< s and r -< q -< s. 

The fact that r -< p means that 

- (r. + - rj - > , (2.4) 
and the fact that r -< q means that 

r^-(r.,-^f-rl-rl>0. (2.5) 
Adding the two together gives 

< 2r^-2rl~2rl-ir,-^)' + r^-ir, + ^y = 2ir^~rl~rl~rl)-^ . (2.6) 
This implies 

sjrt -rl-rl-rl > ^ , (2.7) 

which means that the distance between r and the origin is greater or equal to 
I In a similar way, the distance between point s and the origin is also greater 
or equal to |. 

But in the discussion of timelike distances it was shown that the distance 
between two causally related events is equal to the length of the longest possible 
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curve between them. Thus, 

r(r, s) > r(r, 0) + r(0, s)>^ + ^ = t. (2.8) 

At the same time, there is at least one choice of (r, s) satisfying r(r, s) = r, 
namely, it is 

r = (-^,0,0,0); s = (^,0,0,0). (2.9) 

This proves that 

r(p, q) = inf{r(r, s) \ r^p^sAr^q^s}. (2-10) 

However, even though this is true in ideal flat Minkowski space, it is no 
longer true in a discrete case. As a result of the fact that one can perform a 
Lorentz boost in the yz-plane, there are infinitely many choices of r and s, all 
of which are separated by the distance r. In the discrete result of 

random fluctuations, some are separated slightly further away from each other 
than r, and some are slightly closer to each other than r. 

By definition, infimum selects the ones that are slightly closer. But, in 
light of the fact that there are infinitely many of them, there will be few that 
are A LOT closer, much like if one is to throw a dice infinitely many times, 
one might get 100 heads in a row at some point, with absolute certainty. Now, 
even one pair of points that is a lot closer than r is sufficient for infimum to 
be a lot smaller as well and thus not a good estimation. 

For that reason, in [17] they proposed a different definition: they consider 
all possible points s that are linked to p and g by a direct link. Here, when 
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a ^ b are related by "direct link" it is meant that there is no point c satisfying 
a -< c -< b. Thus, 

r{p,s)^r{q,s)^0, (2.11) 

which implies that s is either (t/2, 0,0,0) or a Lorentz boost of that in xy 
plane. For each such s, they find a point r{s) such that r(s) -< p and r(s) -< q 
which minimizes r(r, s) with respect to that constraint. Then, they average 
T(r(s), s) over all possible s. Since they average as opposed to minimize, they 
count both the cases when the distance is slightly lower than needed and the 
cases where the distance is shghtly larger than needed, which averages out to 
a correct distance. 

2.3 Vector Fields on a Causal Set 

Since in the causal set framework the spacetime is no longer viewed as a 
manifold, a tangent bundle is not well defined. This is also seen from the fact 
that Lorentzian indices are needed in order to specify value of gauge field. 
Therefore, the definition of gauge field needs to be replaced with something 
else. The latter should satisfy two properties: 

1) It is well defined for the case of a manifold, and in the latter case it has 
a known correspondence to vector fields 

2) It continues to be well defined if a given set is not a manifold. 

First, consider a simple case where any pair of points p E A4 and q e 
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Ai are connected by a unique geodesic. In this case, there is a one to one 
correspondence between functions V : M. ^ TA4 and v: A4 x J\4 ^ M., as 
long as both sets of functions are restricted to infinitely differentiable ones: 



where 7(p, q) stands for the geodesic segment connecting the two points. 

Since V: A4 —>■ TM. and v: M. x Ai —>■ M are in one to one corre- 
spondence, it is possible that the key player is f : At x At — > IR rather than 
V: M. ^ TM. and we were fooled by the one to one correspondence thinking 
that \t \sV : M TM. If such is the case, v: M x M ^ ^ can be gen- 
eralized from the manifold to a causal set in a straightforward manner since, 
being a real valued function, it makes no reference to Lorentzian indices or 
any other indicator of manifold structure. 

For the case where there is more than one geodesic that connects some 
pairs of points, things are no longer as simple. Consider, for example, a cylin- 
der Mx [0, 1] where and 1 are identified, and distances are defined in Euclidian 
sense. Suppose a vector field V is defined as follows: 



dv{p, q) 
dq" 



(2.12) 



and 




(2.13) 



1) V, 



for y < 1/4 or y > 3/4 



2) 



X for 1/4 < y < 3/4 



3)n 



everywhere. 
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There is more than one way of connecting pairs of points via a geodesic. 
To remove the ambiguity, estabhsh a rule that for any pair of points p and q 
the geodesic is selected whose length is the closest to 1. This means that if 
p and q are very close to each other, the geodesic will be selected that circles 
the cylinder once. It is easy to see that in this case 

v{p,q) ^^{qx-Px) , (2.14) 

implying that 

^^(P,9)U=^7^V;. (2.15) 

One way to establish a one-to-one correspondence while avoiding that 
difficulty is to make the following criteria: 

1) Suppose that there exist at least one disc D such that 

a) p and q arc both elements of D 

b) If r and s are any two elements of D then there is only one way they 
can be connected by a geodesic segment, 7(r, s) in such a way that it does not 
escape the interior of D, that is, r, s e ■j{r, s) C D. 

In this case, ■yip, q) is used for the definition of v{p, q) 

2) Suppose p and q are so far away from each other that the above- 
mentioned D does not exist. In this case, v(p, q) ~Q 

Thus, the expression of v as an integral of V is true only locally; if the 
points are spaced far enough from each other that one has to make a choice of 
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what path to take, none of these paths would give v. On the other hand, the 
expression of as a derivative of v is always true since the latter, by definition, 
is local. 

This means that v is more fundamental than V since the latter is expressed 
in terms of the former and not the other way around. This confirms that the 
vector field is really v and we were mistaken in thinking it was V, which would 
allow the generalization of vector field on non-manifoldlike situations. 

For the case of Minkowski space, the disc can be replaced by an Alexandrov 
set, which sets the following correspondence: 

DEFINITION: Let U be an open subset of a manifold /A. U is said to 
be UNIQUELY CONNECTED if for any pair of points r e W and s e W 
there exist a unique geodesic segment 7(p, q) which is contained in U. In 
other words, there exist a unique geodesic segment 7 satisfying {p, q} C 
7([7~^(p), 7~''-(s)]) C U . Such 7 is denoted as ^u{P:Q) 

DEFINITION: Let r and s be two elements of M. They are said to be 
UNIQUELY CONNECTED if the following is true: 

1) There exist at least one uniquely connected Alexandrov set a{p,q) 
which contains both r and s 

2) If {r, s} C a{pi, qi) n a{p2, 52) then -faipha{pi,qi){r, s) = 7a(p2,g2)('^> 

The above common value of "ya(p,q) {t^, s) is denoted by 7(r, s) 

DEFINITION: Let Al be a Lorentzian manifold, and letV: M ^ TM 
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be a vector field. Then : x ^ M is a function defined as follows 
1) If p and q are uniquely connected, then 



2) If p and q are NOT uniquely connected, then v{p, q) = 0. 

As was mentioned earlier, the fact that V is always a derivative of v but 
V is not always an integral of V implies that v is more fundamental than V, 
which is what is desired since the former does not appeal to manifold structure 
while the latter does. 

This means that, for example, it is possible to set v{p, q) — 1 instead of 
q) — ioT points that are not uniquely connected, and this will amount 
to a physical field having different values. However, despite the fact that the 
value of the field will, in fact, be different, the value of Lagrangian density 
will still be the same. In other words, Lagrangian density is symmetric under 
the variation of the value of the field for pairs of points that are not uniquely 
connected. 

In performing the path integral, all possible two point functions v: M. x 
— i> M will be taken into account, not just the ones that can be derived from 
V. After all, in the general case there is no such thing as V, so it should not 
factor into the definiton of path integral. This means that different v-s that 
are related to each other by symmetry will be added to each other, resulting 
in overcounting. 




(2.16) 
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However, Fadeev-Popov ghosts are NOT introduced to deal with that 
overcounting. Instead, the infinity is avoided by the fact that the range of 
integration is hmitted to a fixed tunnel (see sec 2.6 and 5.4). This tunnel 
corresponds to the "collapse" of the quantum system into a roughtly defined 
trajectory up to smallest classical scale. Since all fields, including the vector 
field, are "collapsed" to the range defined by classical trajectory, overcounting 
will not produce an infinite result. 

In terms of more rigorous math, a generalized notion of a vector field is 
defined to be a differentiable function g: AixM.. In the case of a differentiable 
manifold, to every differentiable function g: A4xA4 ^ M, corresponds a vector 
field Vg-. M X ^ R such that, for every p e M and / G J^, 



and, in case that any pair of points are connected by unique geodesic, to 
any vector field v defined in a usual way for a manifold, the corresponding 
holonomy : A1 x Al — > M is given by 



(2.17) 




(2.18) 



where d7 is a vector defined as 



d7(/) 



d(7(/(t)) 



(2.19) 



dt 
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2.4 Lack of Symmetry and Definition of Vectors 

In the last section it was stated that due to the lack of manifold structure, 
the notion of tangent vector can not be used, which is the ultimate reason 
why an alternative definition of vector, as a two-point function, is introduced. 
This raises a question: what about a square lattice? Strictly speaking it is not 
a manifold, yet one can easilly define Lorentzian indices corresponding to its 
discrete coordinates. 

At first one might say that it is not the issue of discreteness by itself, but 
rather the issue of lack of coordinates that forced us to abandon the standard 
definition of a vector. However this is not quite true either. Strictly speaking, 
it might still be possible to "label" different elements of a causal set, and adjust 
these labels in a way that the causal relations correspond to what the labels 
would tell us - or, if that proves to be too difficult, adjusting the metric tensor 
link-wise might be another avenue. On the other hand, in the case of a regular 
manifold, a lot of differential geometry books define vectors in a coordinate- 
independent way in order to emphasize their pure geometric nature. Thus, it 
is important to look at the real reason, beyond the presence or lack of labels, 
as to why in the case of causal sets a new definition is needed. 

In the standard theory of manifolds, a vector at a point is defined as a 
hnear operator on a set of differentiable functions. That is, if Al is a manifold 
and ^ is a set of differentiable functions — > M, then a vector field is a 
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function v: x ^ — > M such that for any point p E Ai and any two functions 

/ and g, and any real numbers ki and /c2, 

v{p, kj + k2g) = k^v{p, f) + k2v{p, g) . (2.20) 

and 

^{p, fg) = f{p)v{p, g) + g{p)v{p, f) (2.21) 

Intuitively, a vector v^: M. ^ TA4 corresponds to the differential operator. 

v{pj)^v^{p)d,f\,. (2.22) 

On the other hand, a coordinate system can be thought of as a set of d func- 
tions, : /A ^ M.. Here, = 0, d—1 are simply the names of the functions 
and are not to be confused with coordinates. At every point p e R, v^^{p) can 
be defined as 

v''{p) = v{p,x'') . (2.23) 

The reason that the above definition of vector field, despite being coordinate- 
independent, does not work in general is that it has too few degrees of freedom. 
When Lorentz index is used, one, strictly speaking, looks only at the deriva- 
tives along the coordinate axes, and nowhere else. Now consider the following 
example. Suppose a function / is defined as follows: 

1) /(t,0,0,0) = i; 

2) f{t, X, y, z) — 0, if any of x, y or z are non-zero; 
In this case, d^(j)dfj,(j) — 1. 
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On the other hand, suppose that instead of singhng out the t axis, we 
single out thet' axis, which is tilted relatively to the t axis. In other words, 

1) g{t, X, 0, 0) = t, if X = kt; 2) g{t, x, y,z) = if either x is NOT equal to 
kt OR if either of y or 2; is non-zero. 

In this case, 9^0 = 0. 

This reveals that 9^09^0 is NOT truly Lorentz covariant. The reason we 
don't run into this issue is that in case of differentiable functions we HAPPEN 
to get the same result after performing the rotation. This, of course, is a 
definition of symmetry. 

Thus, the moral of the story is that the fact that the expression involving 
Lorentz indices makes sense is a consequence of symmetry. Once the symmetry 
no longer exists, as in examples presented above, the contraction of Loretnz 
indices is no longer physical since it begins to depend on our choice of labels. 

This applies not only to derivatives but to anything that has a Lorentzian 
index. When a vector is denoted by V^^ what is really meant is that once FOUR 
aspects of that vector are specified, namely its projections on the four axes, 
ALL other aspects are specified as well - namely the projection on any other 
axis is a linear combination of the projections on the four given axes. This 
is ultimately why refers to only FOUR components. Now, that relation 
between projections is called symmetry, thus once there is no more symmetry 
this is no longer true. 
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In case of a causal set, the symmetry is lacking due to the fact that different 
locations in a causal set can not be "perfectly matched" on each other. Thus, 
based on what we have just learned, due to lack of symmetry the vector-based 
expressions are meaningless. 

True, a regular lattice does not obey the rotational symmetry either. In 
fact, that is the main reason why it is not an acceptable model. However, due 
to the fact that it does possess translational symmetry, there is some rough 
correspondence between the two on the scale of large numbers of points, which 
allows us to recover the notion of vector field at that scale. The definition of 
derivative in terms of neighboring points is nothing but an extrapolation of 
what is expected to happen for pairs of points that are close on the scale in 
which we live in, but are still separated by many points and thus are not true 
"neighbors". Its definition on the few-point scale is simply an extrapolation 
of the latter. 

On the smaller scale, on the other hand, the regular lattice structure is 
our enemy rather than our friend, since it is not symmetric under rotation 
or Lorentz boosts. True, random Poisson distribution is not symmetric un- 
der these transformations either. But while Poisson distribution violates these 
symmetries stochastically, the regular lattice structure violates them in a con- 
sistant matter, which makes the latter worse. On the language of symmetries, 
this means that translational and rotational symmetries are not compatible 
with one another for discrete structures. Poisson distribution, by violating 
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both on a small scale, allows to maintain both on larger scale without putting 
one above another. On the other hand, regular lattice, by never violating 
translational symmetry, violates rotational symmetry more consistently. 

This raises a question: since there are no longer rotational symmetries, 
how is Lorentz covariance defined, especially since causal set theory is ad- 
vertized as the one that is loyal to the latter? The answer is that Lorentz 
covariance is now defined as a statement that light cone causal relations are 
the only defining features of geometry. Inside the Lorentzian manifold, this 
is equivalent to saying that speed of light is covariant which is fundamental 
tenat of relativity. At the same time, once phrased in a language of partial 
order, this statement continues to be well defined for a general causal set. 

In the language of symmetries, one can say that while there is no longer 
active Lorentzian symmetry, there is still a passive one. Passive symmetry is 
defined in terms of labeling of points. While there is no reason to do that, 
nor is it done in the theory, one can, if they want, label points with local 
coordinates as well as choose the values of metric 2-tensor in such a way that 
causal relations derived from these will match the aforegiven causal structure 
of the set. Since this labeling is not implimented in the theory, the theory is 
covariant with respect to the choices of such labelings, which means that it 
possesses passive symmetry. 

The core of causal set theory is to abandon such labels altogetgher as 
opposed to claiming the covariance with respect to relabelings. However, this 
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would not have been possible if there were no passive symmetry. Thus, lack 
of labels might be a new, causal set based, version of the definition of passive 
symmetry. In this language, it is easy to see that as long as no geometric struc- 
tures besides causal relations are being used, any and every theory possesses 
passive symmetry by default. 

That symmetry, however, is neither rotation nor translation; but since 
its manifold analogue is Lorentz symmetry it is still defined as such. For the 
latter reason, while the rotational covariance is taken care of passively, the 
translational one is not, which means that making causal set manifold- like 
remains to be one of the most serious problems of the theory. 



2.5 Causal Set Growth by Transitive Percolation 

Before moving on to the novel way of doing dymamics on a causal set, let's get 
some taste of the more standard, stochastic-based dynamics, and prove one of 
its basic results, that there will be a chain of infinitely many "big bangs" or 
"posts" . 

Consider a process of steady growth of the causal set. Initially, the entire 
causal set consists of only one point, point "1". Then another point, point 
"2" , is added. Then a random decision is made between two options: point 2 
is either causally after point 1 or the two points are unrelated (here, the term 
"unrelated" means that neither point is either before or after the other, which 
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is equivalent to a statement that the two points are spacehke-separated) . The 
former option is selected with probability p and the latter with probability 
1 — p. After that yet another point, point "3", is added. Again, 3 :^ 2 is set 
with probability p, or 3 and 2 are left causally bun-related with probability 
1 —p. If it happens that 1^2 and 2^3 then 1 ^ 3 is enforced by transitivity. 
Otherwise, the causal relation between 1 and 3 is selected by the same random 
process: 1 -< 3 with probability p and they are un-related with probability 
1 — p. Then point 4 is added and its causal relations to the existing points 
are determined by the same random procedure, and so on. Whenever point n 
is added, all points from i — n — Itoi — 1 are viewed in decreasing order. 
For each i < n, the question is asked whether or not there is any point j for 
which it is already established that i ~< j and j -< n. If so, i -< n is enforced. 
Otherwise, the relation i ~< n is established with probability p and while i and 
n are left unrelated with probability 1 — p. After repeating that procedure 
starting from i — n — 1 and finishing with i — 1, n is increased by 1 and the 
same procedure is repeated. 

A point is said to be a "post" if it is causally related to every single other 
point in a causal set. Consider, for example, a situation right after the second 
point is added, i.e., when entire causal set consists only of two points. If these 
two points are causally related, then they are both posts. Point 1 is a post 
because, by being "before" point 2 it is formally "before every single point in 
the set" , and point 2 is a post because, by being "after" point 1 it is formally 
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"after every single point on a set. On the other hand, if these two points are 
causally un-related, then neither of them are posts. 

In general, if a causal set is homeomorphic to subset of N, then every 
single element of that set is a post. On the other hand, if it is homeomorphic 
to the region of a square lattice, with causal relations 

(mi, rii) -< (777.2, 4^ (7772 - 712)^ - (7?7i - 7ii)^ < , (2.24) 

then whether or not a given point is a post depends on what subset it is 
homeomorphic to. For example, if it is homeomorphic to {(777, 7i)}|7?7| < \n\, 
then its only post is (0,0). On the other hand, if it is homeomorphic to 
{(777, 7i)|l<777<M, l<7i< N} then it has 2M posts, and they are points of 
the form {m, 1) and {m, N). 

In the remainder of this section it will be shown that if a causal set is 
growing according to the transitive percolation model, then infinitely many 
posts will be generated with absolute certainty. 

Consider again the situation of points being added one by one, and denote 
each point by a number. At the moment when causal set consist of exactly n 
points, let a„ be the number of elements of a causal set that are in the causal 
past of the last point added (and it is assumed that every point is in a causal 
past of itself). Thus, a„ = 1 if and only if 7i-th point is a post at the moment 
when it was first added. However, this does not imply that 7i-th point will 
continue to be a post when more points are added; in fact probability of that 
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is rather small. But, the only points of interest are the ones that survive as 
posts to the very end. In other words, point n is selected if, in addition to 
the fact that a„ = 1, all points that are added later will end up being causally 
after n-th point. 

Thus, in order to prove that there are infinitely many posts, two things 
need to be shown: 

1) With absolute certainty, Un — 1 holds for infinitely many n. Thus, 
there are infinitely many points that start out as posts at the moment of their 
creation. 

2) If a given point is a post at the moment of its birth, there is non-zero 
probability that it will continue to be a post forever. 

Part 2 implies that the number of posts is a finite fraction of the number 
of points that start out as posts. Furthermore, part 1 implies that the number 
of points that start out as posts is infinite. Thus, part 2 together with part 1 
implies that the number of points that survived as posts is also infinite. 

Since part 2 is easier, it will be done first. 



LEMMA: if < Cjt < 1 for all k then, for all n > 2 



n 



n 




(2.25) 



PROOF: It is obvious that the above is true for n = 2: 



(1 - ai)(l - 02) = 1 - ai - a2 + 0102 > 1 - ai - 02 . 



(2.26) 
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Now, suppose the above is true for some n. Then 

n+1 n n 

- Ofe) = (1 - a„+i) JJ(1 - afc) > (1 - an+i)(l - ^ Ofe) 

k=l k=l k=l 

n n 

k=l k=l 
n+1 n+1 

= 1 - ^afe + a„+i ^afc . (2.27) 

k=l k=l 

Since > the above naively imphes that 

n n+1 

l[{l-ak)>l-J2^k. (2.28) 

fe=l k=l 

Thus, by induction, the hypothesis is true for all n > 2. QED 

LEMMA: If a given point, n, started out being a post, it will survive as a post 
with non-zero probability. 

PROOF: Let Pk be a probability that point n survived as a post through k 
steps. In other words, at the point when the causal set has n + k points, point 
n is still a post. Furthermore, let pk be a probability that point n will survive 
as a post after the addition of point number n + k + 1 provided that it was a 
post right before that point was added. Thus, 

k-l 



Pk = l[pi (2.29) 



i=l 



To compute pk, notice that 1 — is the probability that point n will stop 
being a post after the addition of point n + k + 1. The latter is satisfied if and 
only if the point n+p-l- 1 is NOT causally related to any of the points between 
n+1 and n + k, inclusively. The probability of that is 1 — = (1 — p)*^, and 
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therefore 



Pk = l-{l-pf 



(2.30) 



and 



fe-i 



(2.31) 



Let j be some number satisfying 1 < j <k. Then, from the previous lemma, 



k-l 



k-1 



Pk = 11(1 - {i-pY) n(i - {i-pY) > 11(1 - {i-pY){i - Y.(i-pY) 

1=1 i=j 1=1 i=j 

-(n(i-(i-rt-))(i- '^"''^'';""'"Y (2.32) 

i=l 



Now, select 



2J 



(2.33) 



This imphes 



Therefore, 



^ (l-p)i-(l-p)fe ^ 1 
P - 2 



^/^>n(i-(i-^^)')' 

i=l 



(2.34) 



(2.35) 



or, substituting the above-given value of j, 



[logi-P f ] 

n> n (i-(i-pr)- 



(2.36) 



Thus, the probability that point n will survive as a post indefinitely satisfies 



the inequality 



[logi_p f ] 

^oo> n (i-(i-pn- 



(2.37) 



i=l 



In other words, it is bounded below by a non-zero constant, as desired. QED 
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LEMMA: If infinitely many points started out as posts, then infinite subset of 
them will indefinitely continue to be posts. 

PROOF: Let the candidates for post be ai,a2, .... For any given k, the prob- 
abihty that NONE of the points a^, j > k will survive as posts is 

lim (1 - P^)" = , (2.38) 

n— >oo 

since Poo > 0. 

Thus, for any given k there exists, with absolute certainty, at least one j 
for which aj survives as a post; call it ji. By setting k = ji one finds that there 
exists, with absolute certainty, a number j2 such that is a post. Likewise, 
by setting k — j2 there exists with absolute certainty a j3 such that aj^ is also 
a post, etc. This implies that infinitely many points survive as posts in the 
case n ^ oo. QED 

Since it was just shown that if infinitely many points start out as posts 
then an infinitely large subset of them will survive as posts, the only task right 
now is to prove that the former is, indeed, the case. 

For any set S, let its future boundary surface, 9future('S') be the set of all 
elements of S that are NOT before any other elements of S] however, there 
might be elements of S to which they are spacelike separated: 

5future(^) = {ieS\'ijeS^{i^ 3)} . (2.39) 

Furthermore, 5'„ is defined to be 

5„ = {l,...,n}. (2.40) 
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Finally, a„ is defined to be the number of elements of the future boundary of 

On = tl^future'S'n ■ (2-41) 

The statement that infinitely many points start out as posts as they appear is 
equivalent to the statement that a„ = 1 for infinitely many n. 

It is easy to see that if the point n + 1 is causally unrelated to any of the 
elements of Sn, then a„+i = a„ + 1. If there is exactly one element of Sn to 
which the point number n + 1 is causally related, then a„+i = a„. Therefore, 
the probabilities corresponding to scenario a„+i = a„ + 1, a„_|_i = a„ and 
Qn+i < On are given by 

prob(a„+i = On + 1) = (1 - p)"" 
prob(a„+i = an) = a^p (1 - p)"""^ 

prob(a„+i < a„) = 1 - (1 + (a„ - l)p) (1 - p)"""^ . (2.42) 

It is easy to see that, as long as p > 0, prob(a„+i < a„) approaches 1 as a„ 
approaches infinity. This means that there exist a fixed number A{p) such 
that prob(a„+i < a„) > 2/3 whenever a„ > A{p). 

LEMMA: if A{p) is defined as above, and > A{p) for some n then, with 
absolute certainty, there will be an m > n such that < A{p). 

PROOF: Suppose, along with random assignments of aj, there are also random 
assignments of bi, and each basic step of that process is defined as follows: 

1) Set i — n and = a„; 
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Then keep repeating 2, 3 and 4 over and over: 

2) Use the prescription above to randomly select Oj+i based on the infor- 
mation about Gi 

3) Define fej+i based on the following rules: 

a) If h < A{p) then bi+i = hi 

b) If bi > A{p) and Oj+i = 0^ + 1 then = hi + 1 

c) If bi > A{p) and Oj+i = Oj then = bi 

d) If 6j > A{p) and a^+i < then can be either 6j — 1 or bi or 6j + 1 
with the following probabilities: 

1/3-{1-pY" 



prob(6i+i = h + l)\a^^ai+i 
prob(6i+i = bi)\a,,ai+i = - 



l-(H-(a„-l)p)(l-p)««-i 
l/3-a„p(l-p)""-i 



l-(l + (a„-l)p)(l-p)""-i 



prob(6i+i ^bi- 1 L. a.^, = ^ -^^^ — -4t^ , ; 2.43 

4) Increase i by 1 and go back to step 2. 



It is easy to see that if no information is available about Oj, then bi obeys 
the following rules of random walk: 

1) bn = a„; 

2) bi < A{p) =^ bi+i = bi; 

3) bi > A{p) =^ p{bi+i ^bi + 1)^ p{bi+i = bi) = p{bi+i ^bi-1)^ 1/3. 
Furthermore, by induction it can be shown that if Uj > A for all j satisfying 
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n- < J < i, then 6j > Oj. This would trivially imply that 6j > A. By 
contradiction, this implies that ii bi = A for some i, this means that aj < A 
for at least one j satisfying n < j < i. Thus, it has to be shown that, with 
absolute certainty, there is, indeed, i > n for which bi = A. 

For any given b, let be the probability that an above-described random 
process started a,t b^ — b will generate bj — A for some j > i. The process of 
going from bto A can be separated into two parts: first part consists of a very 
first step, which has only 3 possible outcomes: 6+1, 6—1 or 6, and the second 
part is a remainder of a journey to A. Prom this construction, it is easy to see 
that probabilities obey the following relation: 



f{A{p)) = 1 ; V6 > A , (/(6) = ^ (/(6 + 1) + /(6) + /(6 - 1))) . (2.44) 



f{A{p)) = 1 ; V6 > A{p) , (/(6) - f{b - 1) = f{b + 1) - /(6)) . (2.45) 



However, since /(6) represents probability, which means that it has to be in 
the range between and 1. This implies that 



The rearranging of the above equation is 



This implies that, for some k, 



V6 > A{p) , (/(6) = f{A{p)) + k{b - A{p))) . 



(2.46) 



A; = ; V6 > A{f{b) = f{A{p))) . 



(2.47) 



Thus, the fact that f{A{p)) — 1 implies that 



V6 > A{p) , (/(6) = 1) . 



(2.48) 
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This means that, with absolute certainty, there will hei > n such that bi — A. 
Now, if Gj > /I is to hold for all j satisfying n < j < i, then, by induction, 
bj > Qj would hold for all j <i. In particular, this would imply that bi> Ui > 
A, which would contradict b^ = A. So, by contradiction, aj = A for at least 
one j satisfying n < j < i, which proves the statement of the lemma. QED 

LEMMA: In the above-described random process starting from any arbitrary 
value of ai, aj = 1 holds for infinitely many is, with absolute certainty. 

PROOF: The first step is to show that Oj < A{p) for infinitely many values of 
i. This can be seen as follows: if there exists n such that < A{p) for every 
single i > n, the answer is self-obvious. So assume otherwise: for every single 
n there exists at least one i > n for which > A{p). In the previous lemma 
it was shown that whenever Oj > A{p), then, with absolute certainty, there 
exists a. j > i such that aj < A{p). Since i > n, the fact that j > i implies 
that j > n. This shows that, with absolute certainty, for every single n there 
exists j > n such that Uj < A{p). This means that there are infinitely many 
such j. 

The fact that there are infinitely many such j means that they form a 
sequence ji, j2, .... I claim that a^^+i = 1 for at least one k. First assume 
that such is not the case. In this case infinitely many statements have to 
simultaneously be true, where statement implies that aj^+i > 1- Each of 
these events occurs with probability 1 — p"^* < 1 — p"^^^ . Since 1 — p"^^^ is a 
fixed number, the probability of infinitely many of such events occurring is 0. 
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Thus, with absolute certainty, this is bound to fail for at least one k. At the 
point of "failure", by definition, Oj^+i = 1. 

The above statement can be weakened: the statement "0^^,+! = 1 for at 
least one k" can be replaced with "a^ = 1 for at least one /' . Now, if that 
process is started with j + 1 it will similarly show that = 1 for at least one 
k > j. Then, again, the imaginary starting point can be moved to A; + 1 which 
will show that ai — 1 for at least one I > k, and so forth. This will generate 
an infinite sequence ki,k2, ... for which = 1. QED 

THEOREM. Transitive percolation generates infinitely many n with absolute 
certainty. 

PROOF. Every single k for which ak — 1 corresponds to a post. Thus, the 
fact that there are infinitely many such k shows that infinitely many points are 
starting out as posts. Furthermore, it was previously shown that if infinitely 
many points start out as posts, then their infinitely large subset will survive 
as posts. Thus, infinitely many posts will be generated. QED 

2.6 Generating Causal Sets: Using Statistics or La- 
grangians? 

In the last section, it was shown that if causal set was generated according to 
the transitive percolation model, this would imply infinitely many "big bangs" 
in the form of posts. While the particular model presented is certainly not 
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the only one that has been tried for causal sets (albeit the most common and 
studied one) other models that have been tried still lead to similar conclusions. 

It is very likely that this is an obstacle that prevent causal set from being 
manifold-like, especially since much of numerical work predict posts to be 
spaced close to each other. It is possible to hope that by setting the probability 
of causal relation to be extremely low, posts will not be so close, and perhaps 
the manifold structure will appear in the regions between them. 

Furthermore, if gravity can indeed be generated statistically, perhaps there 
is connection between posts predicted on statistical bases and the successive 
big bangs due to universe collapse. However, while the above possibilities can 
not be completely ruled out, extensive numerical work shows that such does 
not happen. Again, it is possible that there might be some other stochastic 
processes that do generate desired manifold, but so far such was not done. On 
the contrary, the results obtained up till now indicate that as the size of causal 
set grows to infinity, the probability of it being manifold-like goes down to 0. 

I believe that the reason for this problem is that typically physics starts 
from experimental data and then the theory in one way or the other adjusts 
to that data, even though it pretends that it doesn't. This did not happen 
for sequential growth model. In order for the stochastic process to have ex- 
perimental support, one has to first find all of its results and then verify that 
these results match the observations. While, strictly speaking, nothing rules 
out the possibility that such might happen at some point in future, so far it 
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hasn't, and it would be very risky to try to stick to a model on a mere basis 
of the hope that one would get lucky at some point. 

However, there were examples where, as risky as that might have been, 
the theory ended up being successful despite the fact that its starting point 
was not motivated by experiments. One of the prime examples of this is the 
classical thermodynamics, where PV — nRT based on statistics and momen- 
tum conservation alone. The spirit of causal set theory is that the spacetime 
itself is broken into atoms, much like gas is. Thus, it is hoped to "simulate" 
gravitation stochastically so that one can say that gravitation to stochastic 
process is the same as PV = nRT to conservation of momentum. 

Of course, if one can actually invent a stochastic process that meets the 
above criteria, it would be a much better and more beautiful theory than the 
one proposed in this dissertation. The problem, of course, is that so far no 
one was able to invent such a process. For this reason, the approach proposed 
in this dissertation directly utilizes Lagrangians and abandons the statistical 
method altogether. Of course, this might look ugly, since it is similar to a 
statement that gas molecules are "choosing" their velocities in just the right 
way so as to "force" PV — nRT to be a statistical average. But, as ugly as it 
might be, it is a safer option since the Lagrangians used do, in fact, go back 
to experimental observations, albeit on scales much larger than the ones of a 
causal set. 

There is, however, a philosophical argument to justify this. Consider, for 
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example, real numbers. At first glance, they appear to be very simple. How- 
ever, their rigorous definition is not as simple: they are defined as Dedekind 
cuts of the rationals (here, a cut is defined as a set -S" C Q such that for any 
two rational numbers a and 6, if a e 5' and b < a then b & S). While the 
definition is complicated on the surface, it is no longer questioned once its cor- 
respondence to real numbers is understood. This is because we are believers 
in real numbers and not believers in the cuts. So, as long as cuts "serve our 
agenda" , it makes perfect sense why they were "invented" . 

However, if one hadn't experienced real numbers in everyday life, one 
would have been forced to be a true believer in cuts, which would lead to a 
similar question: how come the Dedekind cuts happened to be defined in this 
particular way rather than some other way? Perhaps, one would have decided 
to make some random generation of a cut and then show that the specific 
structure that corresponds to real number is the most probable one. This 
would lead to disappointment, since such is not the case. 

What the above example teaches is that existence of real numbers is, too, 
an experimental observation, since the specific definition of Dedekind cut is 
not the most probable one stochastically. Yet, due to the fact that we have 
to "observe" that over and over within seconds, we became so used to that 
observation that it appears to us as "math" . Similarly, it can be claimed that 
a causal set, too, is generated by physics rather than by math. But due to the 
fact that we have hved inside of the causal set for such a long time, we got so 
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used to it that it appears to us as math. 

Since we have only experienced the causal set on a very large scale, the 
only part that appears to us as math is its large-scale geometry. If, however, 
we were to live on a microscopic scale for just as long, then what happens on 
the level of, say, 10 points, would also appear very natural to us and it won't 
even occur to us to ask whether or not it is the most probable stochastically; 
we would regard it as "simple geometry" even though it would have nothing 
in common with the geometry we are used to. 

2.7 Causal Set as a Static Geometry 

In Section 2.5 it was shown that the transitive percolation model for generating 
causal sets leads to the unpleasant fact of the presence of too many posts and 
lack of manifold structure. Then in Section 2.6 it was suggested that perhaps 
the answer to the puzzle is to replace statistics by a Lagrangian. In this 
section, a Lagrangian-based model of causal set structure will be proposed 
that will be used for the dissertation. A necessary component of this model 
is the "collapse" of fluctuating causal relations into a well-defined structure. 
The meat and justification for the specific way of doing that will be extensively 
discussed in the Appendix. For now, it will suffice to simply outline the setup. 

According to the proposed model, both fields and causal relations are 
undergoing quantum fluctuations within a certain tunnel around their "cen- 
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tral" configurations. The size of that tunnel is roughly at the transition point 
between quantum and classical scales. For any specified behavior of fields 
throughout space time, J^o{s), there is a "neighborhood" n{J^o) that consists 
of all possible variations of these behaviors, T — + SJ-', that are regarded 
as "similar" to J-'o by some specified criteria. 

For example, for a scalar field 0, the "(5-neighborhood" around (f){x) = 
(f)o{x) is given by 

ns{<f>o) ^{<f>:S^R\ypeS, \<f>{p) - Mp)\ < ^} ■ (2-49) 

The definitions for neighborhoods of other fields will be discussed in section 2 
of the Appendix. 

One important aspect of the theory is that the notion of neighborhoods 
is also defined for the set of all possible causal relations, i.e., for gravitational 
fields. That is, for any causal relation there is a neighborhood n(-<o) 
consisting of all causal relations, that are "similar to" ^o- Again, the 
actual definition of that neighborhood is left for the Appendix. 

To every {J^q, -<q) corresponds a probability amplitude given by 

Z(^o,^o)= I [VTW^^^^K (2.50) 

For the sake of mathematical rigor, the values of T can be discretized at every 
point and the integration over T can be replaced with a sum. For example, for 
the simple case of the universe consisting of scalar field and gravity by itself 
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(in other words, T — (f)), 

ZMo)-{er E E ^'"^'"'^ (2-51) 

<ens{<o) 4>en{(po) ; yp{4>ip)=ke) 

where 

715(00) = {0: 5 ^ R I Vp e 5 , |0(p) - 0o(p)| < S} . (2.52) 

The constraint that the relation -< belongs to n(^o) will provide the back- 
ground geometry that the theory used to lack. It is important that J-' is 
constrained to n{J^o) as well, since the behavior of T plays a role in Einstein's 
equation. 

This defines a probability amplitude determined entirely based on a La- 
grangian and not on a stochastic process. This by itself by no means implies 
a manifold-like structure, unless the Lagrangian is designed in a way that it 
would. However, by restricting fluctuations of gravitational fleld to a spedified 
range, the argument against the manifold structure is removed as well, which 
is a first step along the way. 
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3 Chapter 3: Type-1 Bosonic Fields 



3.1 Fields on a Causal Set 

In order to define Lagrangians it is important to first define fields on a causal 
set. Since scalar field is defined without reference to manifold structure, it's 
definition can be borrowed from a regular manifold case, replacing manifold 
A4 with causal set S: 

DEFINITION: A real scalar field is a function 0: 5" ^ R and complex 
scalar field is a function 4>: S ^ C 

A vector field is trickier to define, and its definition is discussed extensively 
in section 2.3. Here, it will simply be borrowed: 

DEFINITION: A vector field on a causal set 5 is a function v. SxS 

Putting these two definitions together, a general definition of a bosonic 
field in a manifold combines both scalar field, : 5" — > C as well as vector field 
v: 5 X 5 — >• M is as follows: 

DEFINITION: Let 5 be a causal set. A field on S is either / : 5" ^ M or 
/: -S" ^ C, where A C VS. 

The key here is that these fields are all real valued functions, thus are well 
defined for arbitrary causal set. 
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3.2 Type-1 Lagrangian Generators 

Our next goal is to define Lagrangians for causal sets. In the case of a lattice 
field theory, the Lagrangian density at any given point is defined in terms of the 
field at that point as well as at the neighbors of that point, where "neighbor" 
is one of the 2d points that are directly hnked to the point of interest by the 
square lattice. In the case of a causal set, however, the notion of a neighbor is 
no longer well defined. 

Even in the special case of a manifold, due to lightcone singularity, the 
region whose Lorentzian distance to any given point is bounded by e has in- 
finitely large volume. Since being connected by a "direct link" is equivalent 
to having Lorentzian distance equal to Plank scale, this means that any given 
point is connected by direct links to infinitely many other points, which qual- 
ifies all of them as possible candidates for "neighbors" . 

For these reasons the Lagrangian cannot be defined in terms of a hnear 
superposition of some function of all possible neighbors. However, it is still 
possible to define a "Lagrangian generator" which will be a function of all 
possible choices of a few-point collections of neighbors. Since these collections 
contain more than one point, there is no more need to define any further 
"neighbors" of points that are being considered, thus the above difficulties are 
avoided. 

Furthermore, a standard procedure will be introduced that dictates the 
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definition of a Lagrangian based on the Lagrangian generator. Roughly speak- 
ing, that procedure is designed to select "representative" collections of points 
that would correspond to the extremum of the Lagrangian generator with re- 
spect to some well defined constraints. It is shown by the examples of well 
known fields that the value of Lagrangian generator at that select set of points 
is, in fact, proportional to what is usually defined as Lagrangian density. 

Consider the example of electrodynamics. In order to be able to define 
the Lagrangian, one has to define electric and magnetic field in terms of vector 
potential. Due to the lack of a coordinate system, it is not possible to write 
down the definition of electromagnetic field directly. However, some informa- 
tion about electric or magnetic fields, albeit very insufficient, can be extracted 
from any triple or quadruple of points: an integral of vector potential around 
the loop defined by these points. In principle, a value of electromagnetic field 
at a point can be deduced from the values of the field at all the possible loops 
containing that point. 

Lagrangian generator is defined a function on set of loops rather than set 
of points, and it will be defined based on circulation of electric and magnetic 
fields around these loops. After that, a standard procedure will be introduced 
to go from Lagrangian generator on set of loops to Lagrangian density on set 
of points. 

The set of such loops is identified with S^. In generalizing from specific 
example of electromagnetic field to a general case, is generalized to S\ This 
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leads to the following definition: 

DEFINITION: A Lagrangian generator is a function K.: D x Tgk-^c K. 
Here, D is a. subset of for some i and Tx^y denotes the set of all functions 
j:X^Y. 

It can be shown that as long as constant distances are used as opposed to 
constant volumes, the set has sufficiently many elements: 

THEOREM: Suppose p ^ q and t(p, q) = A. Then, for any B < A there 
exist r )^ p and s ^ q such that r(p, r) = r(s, q) = B 

PROOF: By definition, there exist a sequence of points p ^ ri -< ... ^ 
ta-i -< q and this is the longest possible future directed chain of points con- 
necting p and q. Thus, p-<ri-<... -<rBisa chain of points connecting p 
and tb-i of length B. Suppose there is another chain of points connecting p 
and Tfi, p ^ ai ^ ... ^ ac -< tb- Then this gives another chain of points 
connecting p and g, p -< Oi -< ... -< ac ~< tb -< tb+i -< ■■■ -< ta-i -< q The 
length of this chain is A + C — B. Its length has to be smaller or equal to 
the length of original chain: A + C — B < A. This implies C < B. Since 
this argument works for arbitrary p ~< ~< ... ~< ac ~< tb, this implies that 
t(p, rs) < B. At the same time, p -< ri -< ... -< implies t{p, tb) > -B, which 
means T{p,rB) = B 

Identical argument, replacing -< with y, p with q and with VA-k implies 
that T{rA-B: q) = B. QED 
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In case of gauge field, J^s^^^ is a set of all possible gauge field holonomies. 
Thus, )C: X J-'a^r defines a rule by which one can start from gauge field 
holonomies and get a real number corresponding to each square loop: 

^(a;Pi,P2,P3,P4) = (a(pi,P2) + a(p2,P3) + a(P3,P4) + a(p4,Pi))^ ■ (3.1) 

That real number is interpreted as the flux of the gauge field through that 
loop. 

The procedure of going from Lagrangian generator to Lagrangian will be 
puzzling at first, but at subsequent sections it will become clear why it is 
defined in a way that it is: 

DEFINITION: Let /C : D x W x Ta-.c ^ M be Lagrangian generator; further- 
more, suppose that p -< q. Then 

lC^^{f;p,q)^mB,x{)C{T,f)\TeDUa{p,q)y} 

iCu^Mp, q) = min{/C(r, f)\TeDU {a{p, q))'} (3.2) 

^^{f;P, Q) = A^max(/;P, q) - /Cmin(/;P, ?) 

The only Alexandrov sets a{p, q) that are relevant for the Lagrangian density 
at X are the ones that minimize A/C with constraints that x C and t{p, q) = Tq. 
The set of these Alexandrov sets a{p, q) corresponds to a set of pairs of points 
p -< q, and is formally defined as follows: 

DEFINITION: Let /C : D x W x JF^^c ^ K be Lagrangian generator and let 
X be any element of S, then 

ar{x) — {p, q E S \ p ~< X ~< q ; a{p, q) — r ; 
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Vr, s{r -<x -<sA a{r, s) = r ^ A/C(/; r, s) > A/C(/; p, q))} . (3.3) 

In most cases a{x) is a one-element set, but it is defined as a set in order to 
accommodate a few cases wliere it isn't. 

Finally, the Lagrangian density based on )C is given by 

C^M. , (3.4) 

and the total Lagrangian is given by 

S^.,r{f)=V,Y,C^,r{f.^). (3.5) 

where vq is a volume taken up by one point, which is often assumed to be 
Plank volume. 

While the above is done for arbitrary r, it is assumed that in reality r is 
equal to some constant Tq. Its value, in principle, can be measured although 
is currently unavailable. The Lagrangian is given by 

Sfc = Sfc-T, ■ (3.6) 

It is important to stress that in the same way as in usual quantum field 
theory writing down a theory of a particular field is equivalent to writing down 
its Lagrangian, in the same way in causal set case, writing down a theory is 
equivalent to writing down Lagrangian generator. However, in light of the 
fact that the transition from Lagrangian generator to Lagrangian density is 
non-linear, it is not correct to simply add all the Lagrangian generators into 
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one single generator. Instead, they have to be hsted as sets. Thus, 

n 

^ = {^1, ■■■,^n} , C,K = ''^^Ki- (3.7) 

i=l 

For example, in case of charged spin-0 particle interacting with electromagnetic 
field, the Lagrangian generator is given by /C = (/Ci, /C2, /C3, /C4, /C5) where 

A^2(p2,g2;0,a) = -Ci\a{p2,q2)<p{q2) -0(^2)1^ 
^3(^3,53, '^3,53) = C2(a(p3,g3) +a(?3,r3) + a(r3,S3) + a(s3,p3))^ 
K,a{pa, r4, ^4, Si) = -C2(a(p4, ^4) + 0(^4, r^) + a(r4, S4) + 0(54,^4))^ 
/C5(r, s) = m2(r s)((/)(r) + (t){s)f (3.8) 

(here r-<''s = lifr-<s and r -< s = otherwise) 

under the following constraints: 

1) Pi ^ ?i ; 

2) p2 and 52 are space-like separated from each other; 

3) P3, ?3 , '"3, and S3 are spacelike separated from each other; 

4) p4 ^ r4 ^ gf4 and ^4^54^ gf4. 

CLAIM: Let 5* be a Poisson distribution of points in a Lorentzian manifold 
M.. Furthermore, suppose there are scalar and gauge fields on A4, which are 
differentiable and vary slowly enough. 5" is viewed as a causal set with respect 
to causal structure inherited from 7W. Scalar and gauge fields are "inherited" 
into that causal set in the form of a scalar field and a holonomy. If that 
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Poisson distribution is dense enough, then Lagrangian densities of the scalar 
and vector fields on Ai as defined in regular quantum field theory approximate 
the Lagrangian density defined based on Lagrangian generators on S. 

The proof of the vahdity of this claim is the subject of the next two 
chapters. 

3.3 Type-1 Scalar Field 

Assuming that there is no interaction with electromagnetic field (thus, set- 
ting a(r, s) — 1) Lagrangian generators for the scalar field, corresponding to 
timelike (t) and spacelike (s) pairs of points are 

]Ct{rt, St; (f) = ct(0(st) - (t>{rt)f 
ICsivs, Ss] (j)) = -Cs{(j){rs) - (t>{ss)f 

/C^(r, s, ct>) = ^m\r ^- s)(0(r) + cp{s) f (3.9) 
under the following constraints: 

1) n-<st] 

2) Ts and Ss are space-like separated from each other; 

Lets first start with mass term, /C^. It is easy to see that if (j) is roughly 
the same inside an Alexnadrov set, then the variations of \ (r s)(0(r) -|- 
(t>{s)Y are mostly due to r s changing from to 1. Thus, the desired varia- 
tion is roughly equal to |m^0^, which corresponds to mass term of Lagrangian. 
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It is also easy to see that if were to vary, the variation would be larger rather 
than smaller. Thus, indeed, the desired mass term corresponds to minimized 
variation. 

Now lets move on to Kt and /C^. The prescription to go from K, io C 
requires the minimization of the maximum variation of <p between two points 
inside it: 

C = min max {(f){r) - (3.10) 

r(p,(j)=To r,s&a{p,q) 

This means that the problem splits into four parts: 

1) K, — K-t ; The gradient of is timehke 

2) JC — JCt, The gradient of (f) is spacelike 

3) K, — ICs] The gradient of is timehke 

4) K, — K,s-i The gradient of is spacehke 

CASE 1: K, — ICt ; The gradient of (f) is timehke. 

Let's start from /Ci — > £1. In other words, the pairs of points of interest 
are causally related. 

Choose a coordinate system in which the gradient of points in the t 
direction, while the remaining coordinates x'^ vary along spacelike directions 
normal to 5^0. That is, = 0. If r and s are elements of the Alexandrov set 
ot{p,q) then |r° — s°| < g° — thus \(t){r) — 0(<s)| < \(t){q) — 4'{p)\- Thus, the 
maximal variation of the scalar field inside the Alexandrov set a{p, q) is given 
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by 10(9) — But due to the assumption that the gradient of points in 

the t direction, the latter is proportional to g° — p'^. Thus, the variation of 
inside the Alexandrov set a{p,q) is minimized whenever g° — is. If the 
constraint r(p, q) = Tq is imposed, the above minimization implies p^ — q^ = 0, 
or in other words the direction of the line passing through p and q should 
coincide with the direction of the gradient of 0. Thus, 



£1 = c min max (0(r) - 0(s))^ ^ {5tf{do(t)f = {Sifd^cPd^^ . (3.11) 



Even though is a finite version of infinitesimal, due to the fact that it 
is constant, it is absorbed in an overall coefficient, which might be very large 
and thus produce finite result. Thus, the Lagrangian takes the expected form. 



where dt = Ct{St)'^. 

It would have been simpler if p and q were used instead of r and s. After 
all one can predict ahead of time that the maximization criterion will select 
r = p and s = q. The reason the above expression is left in r-s form is that 
the p-q simplification would not work in the case of a spacelike gradient of 
and for the purposes of consistency of the theory it is preferred that the 
expressions for timelike and spacelike cases be the same. 

It can be shown (see [8]) that in dimension d the volume of an Alexandrov 
set is 



Ci ^ dtd^'(f)d^(j) , 



(3.12) 



(d-l)/2 



V{.a{p,q)) = kdT'^{p,q) , kd : 



d{d-l)2'i~^T{{d-l)/2) ■ 



(3.13) 
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Thus, the above expression can be rewritten as 



|9o0| = ( 77 niin max |0(r) - (f)(s)\ , (3.14) 

1^0 / T(p,q)=To r,sea{p,q) 



and its covarient generahzation is 

dyd^<P=(-^] min max (0(r)-0(s))^ (3.15) 

\ ►'O / V(a{j>,q))=To r,sea{p,q) 

CASE 2: K, — }Ct', The gradient of is spacehke. 

Choose coordinate system in such a way that gradient of points in x 
direction. It will be shown that, again, the Alexandrov set that minimizes 
fluctuations is the one whose axis points in the t direction (although there is 
a lot of freedom of choosing t axis in a way that x axis still coincides with the 
direction of gradient of 0) . 

This can be done in two steps: (a) Show that if the axis of Alexandrov 
set coincides with t axis then the sought-after fluctuation is equal to |t^|90P, 
and (b) Show that in all other cases fluctuation is greater or equal to r^. 

(a) Assume that the axis of Alexandrov set is parallel to t axis, and suppose 
p ^ r ^ s ^ q. The goal is to show that Ir^ — s^] < r/2. Assume otherwise. 
That is, assume \rx — Sx\ > t/2. Applying that to right hand side of t/2 = 
r — r/2 gives 

I = r - ^ > r - (st - n) = - St) + (^ + = (g* - St) + in -pt) . (3.16) 
The causal relations p ~< r and s ~< q respectively imply qt — St > \sx — qx\ and 
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rt-Pt> \Px - r-xl which gives 

7- 

^ > \sx - Qxl + \Px - rx\ (3.17) 
Implementing the fact that Px = Qx = ^ , the above inequahty becomes 

T 

^ > \sx\ + \rx\ = \sx\ + \ - Txl (3.18) 
By the triangle inequality, this implies 

^>\sx- rx\ , (3.19) 

as desired. 

(b) Now suppose that the axis of Alexandrov set is tilted with respect to 
t axis. If it is tilted in yz plane, then the coordinate system can be rotated so 
that in new coordinates it is again parallel to t axis leading to previous result. 
Thus, the only situation of interest is when the axis of Alexandrov set has x 
component in which case it can not be gotten rid of since x axis is set to be 
parallel to gradient of 4>. In this case, again rotate coordinate system so as to 
get rid of y and z components of Alexandrov set, thus its axis lies on xt plane. 

Consider two lightlike lines, li passing through p in x + t direction and I2 
passing through q in x — t direction. They are given as 

^1 = {(oi + pt, ai + px, 0, 0) I oi e R} 

h = {(% - 02, 02 + Qx, 0, 0) I 02 e M} . (3.20) 

Let s be the unique elements of h 0/2- The cases of r = p and r — 2 imply 
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that 

sup{rt - St\r,s e aip,q)} > meix{\st - Pt\, \st - qt\} 

> max{|ai|, |a2|} . (3-21) 

The values of ai and 02 can be derived by treating the respective equahties 
of non-zero components of h and I2 as a system of two equations and two 
unknowns: 

ai+Pt = qt-a2; ai+p^ = a2 + qx ■ (3.22) 
This can be rewritten as 

ai + a2 = qt-Pt; ai - (^2 = Qx - Px ■, (3.23) 

which gives 

oi = ^ {(It -Pt + Qx-Px); «2 = ^ (qt -Pt+Px- Qx) ■ (3.24) 

Thus, 

sup{rt - St I r,s e aip,q)} > max{|st \st - qt\} 

> max{|ai|, |a2|} = ^ (g* -pt + \qx - Px\) 

>\i^t- Pt) > ViQt-Pty - {qx-PxY 
^\r{p,q). (3.25) 

But in part (a) it was shown that for the case p^ — qx the above supremum 
is exactly equal to r/2. This, combined with the fact that gradient of is 
parallel to x axis, implies that Alexandrov sets satisfying Px = qx minimize 
the fluctuations of interest. 
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This means that the — Qx Alexandrov set will be selected as the repre- 
sentative one in defining the Lagrangian, which implies that 

A = ^ {d,cf>f = -^d^cpd.cP = -^d'^cpd.cP . (3.26) 

CASE 3: K, — ICs'-i The gradient of is timehke. In this case, just hke for the 
case 1, the minimizing Alexandrov set is the one whose axis is parallel to the 
gradient of 0. This will be shown by similar steps as in case 2: in step a the 
fluctuation will be computed for that specific Alexandrov set, and in step b it 
will be shown that all other Alexandrov sets have larger fluctuations. 

a) Let proj v denote the projection of v on the xyz hyperplane. The goal 
is to show that whenever Sq — Tq > t/2, r and s are timelike related. This 
automatically implies that for any spacelike related r and s, |so — ?^o| < t/2. 

Assume sq — tq > t /2. This implies 

- t/2 < ro < < So < t/2 . (3.27) 

Therefore, 

|projr| < r/2 + ro < r/2 + so-T/2 = r/2-(r/2-so) < r/2-|projs| . (3.28) 

By the triangle inequality, this implies 

|proj (r — s)| = |proj s — proj r| < |proj r| + |proj s| < t/2 . (3.29) 

But the fact that |proj {r — s)\ < t/2 combined with the fact that St — Vt > t/2 
implies that r -< s. Therefore, if r and s were spacelike separated then, by 
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contradiction, the original assumption that Sf — rt > t/2 has to be wrong, i.e. 
\st~rt\ <t/2. 

b) In part b of case 2 it was shown that if Alexandrov set is "tilted" then 
one can produce a point s that is lightlike separated from both p and q such 
that its coordinate difference with one of the two points is greater than t/2. 
The identical argument carries through to the situation at hand. The only 
modification is that instead of using p and q, one has to use p + e{q — p) and 
q — e{q — p) in order to ensure that points of interest are spacelike separated. 
The e ^ hmit will imply that the fluctuations are greater than t/2. 

The Lagrangian predicted from this generator is 

Cs = ^(^00)^ = "-^3^4^8,4^ = jd^4>d,4> (3.30) 

CASE 4: JC = ICs] gradient of 4> is spacelike 

In this case, start by selecting a frame in which the gradient of 4> points 
along the x axis. That is, for any Alexandrov set a{p, q) select a frame in 
which p and q lie in the t-x plane. By considering the cases Px < < q^ 
and < < Px, in both of which pf^ = —q^, it is easy to see that the two 
intersections of the Alexandrov set with the x axis lie on — — | g^; | and + 1 g^ | ■ 
Since the gradient of points in the x direction, the variation of (p is minimized 
whenever g* + |ga;| is. But qt is bounded below since the distance between the 
end points of Alexandrov set is fixed. This implies minimization of \qx\- This 
requires q^ = 0. Since it is assumed that q lies in the t-x plane, this means 
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q — 0, 0, 0) and p'' = —qi^ implies p — (— 0, 0, 0). The intersections of 
the Alexandrov set with the x axis are at a; = — |t(p, g) and x = |t(p, g). 
Thus, 

= Cs{d^(t>? = -Cst\p, q) {d^'(t>d^(t> = -dsd^(t>d^(t> . (3.31) 
By combining all four cases, the total Lagrangian based on {ICt,ICs) is given 

by 

{dt + ^)d>'(l)d^(l) if d>'(t)d^(t)>Q 
C^l (3.32) 

-(4 + f )a^0a/,0 if ^^'(i)^^(i)<Q . 
Since whether or not d^(f)dn(f) greater than is relativistically covariant, the 

above definition of Lagrangian is relativistically covariant as well. Thus, in 

principle, there is nothing wrong with the above Lagrangian, except for the 

fact that, of course, such was not observed in the lab. However, the observed 

Lagrangian is a special case of above for Cg — —Ct or, equivalently, dg — —dt- 

Thus, such will be assumed from now on. 



3.4 Type-1 Gauge Field 

We would like to define Lagrangian generator for electrodynamics as a flux 
through the loop defined by a set of points ri, : 

/C„(a; ri, r„) = a(ri, r2) + a(r2, r^) + ... + a(r„_i, r^) + a(r„, ri) (3.33) 

Naively, it is tempting to consider flux through the triangular loop, a(ri, r2) + 
0'{i^2,'>^3) + dii^ 3,1^1) ■ However, there is a problem: the equator of an Alexan- 
drov set is spherically shaped, while its intersection with, say, the x-t plane is 
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not. Thus, the maximal area of the triangles lying in the intersection of the 
Alexandrov set with the x-y plane is not the same as the maximal area of its 
intersection with the x-t plane. 

On the other hand, if fluxes are taken over rectangles, a(ri, +a{r2, r^) + 
O'i'i^s, ^4) + a(r4, ri), it will be shown that in both cases the area is maximized 
by a square. In the spacelike case, the square will lie on the equator while 
in timelike case only two of its points will lie on equator while the other two 
points will lie at the poles. These two kinds of squares look identical to each 
other except for the 90 degree rotation, which means that they have the same 
areas as well. For this reason, Lagrangian generator is given by 

/C = /C4 (3.34) 

One might first ask the following question: if a causal set is inherently rela- 
tivistic, doesn't it mean that the answer is covariant no matter what shape 
of the contour is chosen, as long as the procedure is covariant? If so, why 
wouldn't that apply to /C = /C3? The answer to the covariance question is 
yes, but we have to be more careful as to what is meant by relativistically 
invariant. 

Strictly speaking, relativistically invariant means it doesn't change under 
rotations and boosts. Now, there is no rotation or boost that would take 
a spacelike vector into a timelike vector, or vice versa. Thus, we have two 
different answers for the spacelike and timelike contours, without violating 
relativistic invariance. However, despite the fact that this would be an invari- 
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ant answer, it is not an answer that agrees with experiments. So, in order to 

get an answer that does, we chose to use rectangles instead of triangles. 

While it might be interesting to investigate more general domains of the 
Lagrangian generator in the future work, for the purposes of this thesis we 
would select the domains that would save our time the most: 

D = {(ri, r2, si, S2, S3, S4) \T{ri, ra) = Tq; 

ri ^ Sfe ^ r2 ; r(ri, Sk) + r(sfe, < e} (3.35) 

where e is some small number. The constraint T(ri,r2) = tq implies that at 
the step of going from Lagrangian generator to Lagrangian, the points ri and 
r2 will be forced to coincide with p and q respectively. It is also easy to see 
that the constraint T(ri, Sk) + T{sk, r-i) < e is equivalent to saying that points 
si through S4 lie very close to the surface of equator of Alexandrov set. 

We will have two Lagrangian generators, the timelike and spacelike one: 

si, S2, S3, S4) = si) + a(si, r2) + a(r2, S2) + a(s2, ^i))^ (3.36) 

>Cs(ri, r2, si, S2, S3, S4) = (a(si, S2) + a(s2, S3) + a(s3, S4) + a(s4, si)f (3.37) 
Since /Cj is independent of S3 and S4, we will simply write it as 

^t{ri-, r2, si, S2) = (a(ri, Si) + a(si, r2) + a(r2, S2) + a(s2, ri))^ (3.38) 

From the fact that ri and r2 coincide with poles of the Alexandrov set while s^ 
lie on the equator, it is easy to see that in the reference frame defined by that 
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Alexandrov set Ct measures the electric field while Cs measures the magnetic 
one. 

It is easy to see that 



Now lets talk about Cs- Choose a reference frame in which B points in z 
direction. The flux through the loop is proportional to the area of a rectangle 
which is formed by the projections of points ri, rg, and r4 on xy-plane. 

That area can be computed as follows: we connect each of these 4 points 
to the origin. This would break the picture into 4 triangles. The angles of 
the two adjacent lines of the triangle that meet at the origin are 6'i, 62-1 O3 
and 04. Each triangle can be further broken into two triangles by drawing the 
perpendicular line from the origin to the line connecting the opposite side of 
that triangle. The area of each of the two pieces is 



/Ct(p,g,r,s) = {E- (f - s}f 



(3.39) 



which means that it varies from to I^Pr^. 



Id r cos 1 9i) (| r sin | 9i) 



i sin Oi . 



(3.40) 



Thus, the area of the whole thing is 



4 



4 




(3.41) 



1=1 



i=l 



Thus we would like to maximize Yl'^=i ^^i' with the constraint that Yl'l^i Oi — 



27r. 
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This means that the gradient of Ylt=i should be parallel to the gra- 
dient of Yli=i — 27r. In other words, there is a constant c such that 




(3.42) 



This implies that 

cos6'fe = c . (3.43) 

In other words, all angles are equal. Since their sum is 27r this means that they 
are all equal to |. So this can be accomplished by putting ri and at the 
intersections of the x axis with the boundaries of the Alexandrov set, and r2 
and r4 at the intersections of the y axis with the boundaries of the Alexandrov 
set. This tells us that JCg varies between and \B\'^t'^. 

Thus, the variations of JCt and JCg Sire minimized in reference frame that 
minimizes \E\ and \B\ respectively. 

— * — * 

I claim that that frame coincides with the one in which E and B are 
parallel. In order to show that, we have to first show that such frame exists, 
and then, while working in that frame, we have to show that magnitudes of E 

— * 

and B will be larger in any other frame. 

— * — * 

Let's start with finding a reference frame in which E and B are parallel 
to each other. We start from the reference frame where they are not, and then 
find the kind of Lorentz boost that would bring us into the frame where they 
are. In the original frame, rotate coordinate system in such a way that 

E^^B^^O (3.44) 
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Our guess is that the boost in z direction will bring us into the reference frame 
where E and B are parallel. In the boosted frame, the electric and magnetic 
fields written in a form of two separate space-only vectors are 

E' = (0, ^{Ey - vB,),^{E, + vBy)) (3.45) 

B' = (0, -i{By + vB,),^{B, - vBy)) (3.46) 

Thus, in order for them to be proportional to each other the following equation 
needs to be satisfied: 

Ey - vB, ^ E, + vBy 

By + vE, B,-vEy ^ ' ^ 

This can be re- written as a quadratic equation in v: 

v\ByE, - B,Ey) + v{El + El + Bl + B]) + E,By - EyB, (3.48) 

Its solutions are 



-El - El - Bl - Bl - J {El + El - Bl - Blf + A{EyBy + E^B^f 

V\ — 

2{EzBy - EyBz) 

(3.49 ) 

- El - El - Bl - Bl + ^{El + El - Bl - Blf + A{EyBy + E^B^f 



V2 



2{EzBy - EyBz) 



which, in light of the fact that E^ — B^ — 0, can be rewritten as 



(3.50) 



_|^|2 _ |5|2 _ ^(|^|2 _ |5|2)2 + . 5)2 
Vi = — :^ (3.51) 

2\BxE\ 

_|i|2 _ 1^12 ^ ^/(|i|2_ |5|2)2 + 4(^.^)2 

V2 = -— — ^ J J ^ (3.52) 

2\BxE\ 

However, in order to either of these solutions to be physically valid, they have 
to be between —1 and +1, which is what I am about to check. 
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Let's start with Vi. We notice that 

< (1^1 - \B\y = + - 2|^||S| (3.53) 

This imphes that 

\E\\B\ < -{\E\^ + \B\^) (3.54) 

We also know that 

\BxE\<\E\\B\ (3.55) 

which imphes that 

\E X B\<h\E\'' + \B\'') (3.56) 

Therefore 

vi < - ' ' ^' < -1 3.57 
2|Sx£;| 

which means that V\ is not physically valid. 
Now let's look at V2- We notice that 

{\E\^-\B\^f^^{E-Bf < (|^|2-|S|2)^ + 4|^HS|2 = {\E\'^ + \B\Y (3.58) 

We also konw that 

{\E\'^ - \B\Y + 4:{E ■ Bf > (3.59) 
Together, these imply that 

0< ^(|i|2- |S|2)2 + 4(^-S)2 < + (3.60) 

which implies that 

< (3.61) 
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So V2 is physically valid if and only if it is greater than —1. We will denote the 
inequalities that need to be satisfied for that to happen by a question mark: 
<'' and >''. Thus, by noticing that denominator is positive, we have 

^J{\E\^ - |5|2)2 + 4(E-5)2 - \Ef - \B\^ >■ (3.62) 



>• -2\E X B\ 



which is equivalent to 



^J (I^P - |B|2)2 + 4{E ■ BY >■ \E\^ + \B\^ - 2\E x B\ (3.63) 
Squaring both sides implies that 

{\E\^ + \B\^f + A{E ■ Bf (3.64) 
>• (|E|2 + 1512)2 ^ ^ _ ^ E|(|E|2 + 
After expanding squares on both sides this becomes 

1^1^ + \B\^ - 2\E\^\B\^ + A{E ■ Bf >■ 

>■ 1^1^ + + 2\Ef\Bf + A\B X Ef - A\B x E\{\E\^ + \B\^) (3.65) 

After cancelling and terms and moving things around between left 
and right sides, this becomes 

4(i • Bf - 4|S X Ef + A\B X ^Kl^l^ + l^j^) >• A\Ef\Bf (3.66) 

which, factoring out 4, is 

(^ • S)2 -\Bx + |S X i|(|^|2 + \B\^) l^n^l^ (3.67) 
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— ♦ — ♦ 

Let's now express things in terms of the angle 9 between vectors E and B: 

|S X ^1 = |S||^||sin^| (3.68) 

and 

{E-Bf -\Bx = \E\'^\BWcos'^ e - sin^ 6) = \E\'^\B\\os{2e) (3.69) 
Substituting these we obtain 

\E\^\B\^cos{29) + \E\\B\{\E\^+\B\^)\sm0\ >■ \E\^\B\^ (3.70) 
which, after factoring out becomes 

cos(2^) + + 1 sin ^l) > • 1 (3.71) 

\B\ \E\ ' 

Let's denote the ratio of magnitudes of electric and magnetic field by A: 

— * 

A = (3.72) 

\B\ 

Thus, our inequality becomes 

cos(2^) + (A + A"^) I sin ^1 >• 1 (3.73) 

We notice that 

A(A + A-^) = l-A-2 (3.74) 
aX 

which means that it is decreasing when A < 1, increasing when A > 1 and 
reaches minimum when A = 1. That minimum is 2. Thus 

A + A-^>2 (3.75) 
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This means that if cos(2^) + 2| sin^| > 1, it would automatically imply that 
cos(2^) + (A+A-i)|sin^| > 1 for all A. On the other hand, if cos(2^)+2| sin ^| < 
1, then the vicinity of A = 1 will serve as counter-examples for the above 
statement. Thus, instead of checking the original inequality, we can simply 
check the following: 



cos(2^) + 2|sin^| >• 1 



(3.76) 



Let 



f{9) = cos(2^) + 2sin^ 



(3.77) 



Thus, what we need to check is 



< ^ < TT f{e) > 1 



(3.78) 



It can be easilly seen that derivative of / is given by 



f(e) = 2(cos e - sm{2e)) = 2 cos 6(1 - 2 sin 6) 



(3.79) 



Thus, the maxima or minima of / occur where 



cos ^ = or sin 6 — - 
2 



(3.80) 



This means that they occur at 




5 3 

6 ' 2 



■TT 



(3.81) 



The values of / at these points are 



/(^l) = /(^3) = I , f{02) = 1 , /(^4) 



= -3 



(3.82) 
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The fact that /(^4) < 1 does not bother us because $4 > tt and therefore is 
outside of the range of the angles we are interested in. 

The above values tell us that local maxima occur at 9i and 6*3, while local 
minima occur at $2 and 9^. However, since we are restricting our range to 
the interval between and tt, these two values also serve as potential minima. 
This means that 



< < TT ^ f{9) > mm(/(0), f{9^), /(tt)) 



(3.83) 



Now, /(O) and /(tt) are given by 



/(O) = /(tt) = 1 



(3.84) 



This implies that 



0<9<n^ f{9) > 1 



(3.85) 



as desired. 



Using the earher estabhshed fact that 



A-^ + A > 2 



(3.86) 



this implies that 



cos(2^) + (A + A-^)|sin^| > 1 



(3.87) 



which, as shown earlier, is equivalent to 



V2 > -1 



(3.88) 



Since, as shown earlier. 



V2<0 



(3.89) 
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this implies that V2 is a physical, thus confirming the existence of frame where 
E and B are parallel. 

Now, in the above work it was assumed that the Lorentz boost is performed 
in X direction. This means that, by itself, this does not prove the uniqueness 
of the frame where electric and magnetic fields are parallel. In fact, if we start 
out from the frame in which they are parallel, then performing a Lorentz boost 
in the direction in which they both point, their values will stay unchanged, 
which means that they will remain parallel. 

However, they will not stay parallel if Lorentz boost is pefromed in any 

— * — * 

other direction. Suppose the mutual direction of E and B is z axis. By 
rotational symmetry, we can say that the direction of Lorentz boost lies in xz- 
plane. This boost can be done by first performing a boost in the z direction 
and then in the x direction. The boost in the z direction will leave both fields 
unchanged, while the boost in the x direction will produce 

E' = (0, --fvB, -fE) (3.90) 

and 

B' = {Q,-fvE,-fB) . (3.91) 

If the above vectors were parallel, then y components would imply E = —B 
while z components would imply E — B, which would imply E — B — in 
which case the whole notion of them being or not being parallel is silly. 

Thus, in the frame where they are parallel, any other such frame is pro- 
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duced by boosts in their mutual direction, and in these other frames their 
values are identical. This means that in the case where everything is flat, 
smooth and linear, the answer to the Lagrangian generators given earlier is 
independent of which of the frames in the above class is chosen. 

In the real situation, the frame will be chosen in a way that would minimize 
the random fluctuations due to discreteness and non-linearity. By throwing 
away these effects, Lagrangian generators will return something very close 
to the values of electric and magnetic flelds in the case of sufficiently dense 
Poisson distribution on a manifold. 

By going back to the original Lagrangian generators, we see that we have 
two separate ones: one for the electric field, and the other for the magnetic 
field. On the first glance, this might appear bothersome since neither is Lorentz 
covariant. However, by realizing that the values these fields are taken from 
specific reference frame (i.e. where they are parallel to each other) implies 
that they are replaced by covariant expressions in an arbitrary frame. This is 
similar to d^(j) taken from the reference frame where i-axis is parallel to the 
gradient of (f) being replaced with sJW^^d^ in an arbitrary frame. 

Let's now show explicitly what the covariant expressions are. In the refer- 
ence frame in which E is parallel to S, we can treat them as scalars and have 



70 



the following system of equations 



E^-B^ ^ F^""" F^^ (3.92) 
EB^^e^fS^sF^'^F^' . (3.93) 



This amounts to a system of two equations and two unknowns, which means 
that its solutions for E'^ and B^ are covariant expressions for each of these 
terms separately. This should not surprise us. After all, when E and B were 
treated as scalars, it was done in a special frame where E and B are parallel. 
Thus, the procedure of first finding such frame and then evaluating E and B 
in that frame is covariant, despite the fact that E and B in an arbitrary frame 
are not. 

Now, to satisfy ourselves, let us solve that system of equations to get an ex- 
pression for E and B. The second equation implies that B = eap-ys F°'f^ F'^^ /kE. 
Substituting this into the first equation, we obtain 

F'^^ F,^ ^E'- ^'"^"'^r^^2^''^' ■ (3-94) 
Multiplying it by E'^, and moving all terms to the left-hand side, we get the 



following equation 



E^ - E' F^^ F,^ - ^'"^"'^r^^2^''^' = • (3-95) 



This solves to 
2 



F^'' F,, + ^ (FM- F,,Y + ^ {e^p,s F-P F^^f 



(3.96) 
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Thus, first equation of the system of two equations imphes 





(3.97) 



The Lagrangian generators for these are Lt and Lg respectively, defined earher. 
Thus, we see that indeed they are both Lorentz covariant; they simply include 
a Lorentz contraction that does not exist in practice. This can be easily fixed 
by setting 



3.5 Getting Rid of Unwanted Fluctuations 

There is a side-benefit of the Lagrangian generator approach that is worth 
mentioning: it automatically adresses the issue of unwanted fiuctuations. 

In the last two sections we have shown that if we have a Poisson distribu- 
tion of points on a Lorentzian manifold, then the Lagrangian density derived 
from Lagrangian generator on that scattering will approximate continuum- 
based Lagrangian density defined in a conventional way. However, we have 
implicitly made the following assumptions: 

1) The Alexandrov set that is being selected in a prescribed manner is not 
too small. In particular, it has enough points to make stochastic fiuctuations 
nearly 0, which would make my arguments reliable. 

2) The above Alexandrov set should not be too large either. In particular, 
in the interior of that Alexandrov set both the curvature of spacetime as well 



(3.98) 
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as all fields are assumed to be linear. 

The first condition is taken care of explicitly when the constraint t{p, q) — 
To is imposed in the Ck = niin{/Cmax} part of my definition, as long as r 
is assumed to be large enough for most of such Alexandrov sets to contain 
sufficiently many points to be statistically reliable. 

The second constraint can not be imposed as naively as the first constraint 
can. In a near light cone region the Lorentzian distance between two points 
can be arbitrary small, while the coordinate differences can be arbitrary large. 
Due to large coordinate differences, we can not assume linearity in the interior 
of Alexandrov set, despite the small Lorentzian distance. 

As a result of Lorentzian covariance, the above statement means that 
smallness of Lorentzian distance does not imply linearity, period. This state- 
ment is true regardless of whether coordinate difference small or not. This 
can be illustrated by using a specific example of small Lorentzian distance and 
large coordinate one. For example, we can consider an electron flying from 
lab A to lab B with near lightlike velocity. The Lorentzian distance between 
events of emission and absorbtion of electron is small, while coordinate one is 
large. 

We now make a Lorentz transformation to the reference frame of the 
electron. In this frame, the coordinate difference between the two events is 
also small. What the electron will see is labs A and B flying very close to each 
other with near lightlike velocity. Both labs fit inside of a very small region 
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since both underwent Lorentzian contraction. As a result of this, fields change 
very fast in space. And due to the fact that this spatial picture moves with 
near lightlike velocity, they change in time as well at any fixed space location, 
in particular in the fixed location occupied by an electron. 

Thus, what we see is that, on the one hand, it is not true that fields 
are locally linear in an arbitrary chosen reference frame. On the other hand, 
however, they are in fact locally linear in the reference frame in which we live 
in. This seems to suggest that there is ether. This, however, would contradict 
letter and spirit of causal set theory, since one of its goals is to maintain 
manifest Lorentz invariance. So, instead of saying that there is an ether, we 
would like to say that the way in which fields interact with each other makes 
them "slow down" with respect to each other to the extent that their "center 
of mass" would have a well defined reference frame (which would not have 
happened otherwise due to the non-comactness of the Lorentz group) . 

This situation is something we already are used to in standard physics. For 
example, if we consider earth physics, there will also be a "preferred frame" , 
namely the frame of the Earth. But this does not raise a concern of viola- 
tion of Lorentz covariance because the process of Earth formation was caused 
by interactions that are defined in Lorentz covariant terms. Thus, in order 
to "pick a preferred frame without violating relativity" we have to define a 
relativistically covariant physics that would do the work for us. 

This is exactly what the theory of Lagrangian generators is doing. On 
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the one hand, the criteria of selecting of Alexandrov set with minimal A>C 
is relativistically covariant. On the other hand, however, the Alexandrov set 
selected by that criteria will imply a "preferred frame" (namely, the one in 
which the two end points of Alexandrov set have the same space coordinates). 
This "preferred frame" is determined in light of the behavior of the fields 
as opposed to being imposed from outside. Thus, this is analogous to the 
situation with the earth rather than the situation with the ether. 
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4 Chapter 4: Type-2 Bosonic Fields 



4.1 Failure to View Gravity as a Type-1 Field 

Before proceeding to introduce type-2 bosonic fields, it is vital to understand 
the prime example of a bosonic field that can not be viewed as type 1: a 
gravitational field. In particular, it is important to show that if type-1 gravity 
were to exist, then yes we would still be able to write down its Lagrangian in 
covariant form; the only problem will be the presence of unwanted terms such 
as R^^R'^. 

This is crucial because if the failure to produce Lorentz covariant contrac- 
tion was the problem it would imply a hole in a theory, as its major premise 
is that, by making sure that only causal relations are used to define geometry, 
Lorentz covariance is maintained. 

On the other hand, by producing covariant terms that are not observed 
in nature I make a point that the logic of the type-1 theory is perfectly self 
consistent and relativistically covariant; it is simply that experiments (in par- 
ticular the ones involving gravity) have shown us that there is something more. 
That "something more" is type-2 fields, and its presence does not negate the 
possibihty of other fields still being type 1. 
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Consider Lagrangian generator defined as follows: 



V{a{p, q)) if p ^ g and r(p, q) = tq 



(4.1) 







otherwise . 



V 



Here, tq is the same number that is used as a constraint on the selection of 
Alexandrov set in the procedure of going from Lagrangian generator to the 
Lagrangian. This means that, for any Alexandrov set that is allowed to be 
selected, exactly one pair of points will have non-zero value of gravitational 
Lagrangian generator, namely the end points of that Alexandrov set. This 
means that variation of Lagrangian generator is given by 



As is shown in Ref [7] , the volume of Alexandrov set defined by arbitrary r ~< s 
is given by 



Vol{a{a,b)) = T\a,b){kd+{ARg^^ + BR^^){bf'-a''){b''-a'')) + 0{r^-^^) (4.3) 



A/C = \kdT^ + ARt^' + Br^R^^ib^ - a^W " o^l + 0(ro"+=^)| (4.4) 



In light of the fact that the Ricci tensor has 10 degrees of freedom, while 
Lorentz group has 6 (3 rotations and 3 boosts), 10-6=4 implies that it is 
possible to choose a coordinate system in such a way that Ricci tensor is 
diagonal: 



A/C = Vol{a{p,q)) 



(4.2) 



By substituting p and q into the above. 



R, 



(4.5) 
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In this coordinate system, the variation of the Lagrangian generator becomes 

A/C = \k,r^ + ARt^+' + B4 P^i^ - ""'f + I ■ (4-6) 

We will first evaluate the Lagrangian density in that particular coordinate 
system, and then generalize it to other ones. We will do that by cases. 

For arbitrary p and q satisfying t(p, q) = Tq, 

p,{W - a^f = po{b' - + J2 Pk{^' - = (4-7) 

k 

k k k 

If for all /c, pk > po, then the Alexandrov set that minimizes A/C is the one 
whose axis parallel to t axis, — — tS'^. This means that the latter 
Alexandrov set will be used to define the Lagrangian density, which gives 

yk{pk >po)^C^ poT^ (4.8) 

Now suppose there is at least one i for which pi < po- Then, regardless of the 
values of pj for j 7^ i, if we select 



go = Po + \f'o + —^^ > qi^Pi+ \ . ^" > , qj ^Pj , j (4.9) 

V Po — Pi V Ao — A3 

we would get 6JC = 0. This means that either the above Alexandrov set, or 
some other one with SK. = is used to define the Lagrangian. In either case, 
this implies C = 0. Thus, 

3i(Xi < Ao) =^ £ = (4.10) 
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This can be summarized as 



(4.11) 
otherwise . 



In the above expressions, the components of the Ricci tensor selected in the 
special frame where it is diagonal were wisely replaced with pj. The latter are 
simply eigenvalues of R. Since the notion of eigenvalue is covariant, the above 
expression is covariant as well. This means that we can remember the above 
result, while dropping the assumption of a specifically chosen reference frame. 

Lets satisfy ourselves and actually find out the covariant expressions for 
eigenvalues. The four eigenvalues of Ricci tensor are the solutions of the equa- 
tion 

detiXg^, - R^,) = . (4.12) 

Here, we are using A-s instead of p-s, because we are not sure in what "order" 
to write down the eigenvalues once we found them. Thus, A-s will be some 
permutation of p-s. 

The above expression can be rewritten as 

= det(A5f^^ - R^^) 

— e°'^'^^e'^''^"{\ga^ — Ran){,^g(}v — Rpi^){^gfp — Rfp){^gsa — Rsa) 

— X^e'^^'^^e^p^s Rati + RatiRpu 

— 4 \e°'^'^^ e^"^ s Rati R^v R-yp + ^"'^'^^ ^^'^'"^ RatiRpi^RypRsa ■ (4-13) 

We would now hke to compute the contractions of the e tensor. First, note 
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that 



-e^^^'e^.,, (4.14) 

= K - K K - ^2 K + K + € K K - '^M 

- 5: 5^ 51 6i + 5: 5^ 51 5l + 5: 5l 51 5i - 5Z 5^^ 51 5^ - 5^ 5^ 51 5^ + 5^ 5^ 5^ 5^ 

+ 5; 5l 51 5i - 5; 5^ 51 5i - 5^; 5^ 51 5i + 5^ 5^ 51 5^ + 5^ 5^ 51 5t - 5'; 5^ 5^ 5^ 

- 5^ 51 51 5j + 5Z 51 51 5l + 5^ 5^ 5; 5^ - 5^ 5^ 5^ 5^ - 5^ 5^ 5; 5^ + 5^ 5^ 52 5^ . 

Setting 5 = a imphes 

= S'^s^s^ - 5'^s^s^ - 5'^5^5"' + 5'^ 5^ 5"' + 5'^5'^5^ - 5'^5^5^ . (4.15) 
Now setting j — p imphes 

- e'^^-^'e.^.s = J2 ^^'^X - ) = 2(5^5f - 5X) ■ (4-16) 
Setting (5 — V — \ imphes 

Finahy, setting a — /i — r} imphes 

-e'^^^W = 6^<^^ = 24. (4.18) 

all p. 

Substitution these into the expression for the determinant imphes 

= -24A^ + 24A'^i? + 12\\R'^R^ - R^) + 4X{3RR'^R^ - 2R^R^Rl - i?'^) 

-i?^ + GR^R'^R^^ + GR'^^R^RjRi - 8RR"^R'^Rl - 2(i?^i?f )2 . (4.19) 
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Thus, the four solutions to this equation are 

Ai,2,3,4 = /i,2,3,4(-24, 24, l2{RIRi - R')A{3RR''pR^ - 2R''pR^^Rl - R'), 
- i?^ + 6R^R'^Ri + 6R''^R^R]Ri - SRR'^R^R^ - 2(i?^i?f )') (4.20) 
where /i,2,3,4(a, b, c, d, e) represent the four solutions to the equation 

ax'^ + bx^ + cx^ + dx + e — 0. 

We would now like to be able to single out an eigenvalue corresponding to the 
time-like eigenvector. Noticing that 

d 

Roo = Xi^R = 2Xi (4-21) 

i=i 

Rqq can be produced in the following covariant form: 

d 

Roo = lim y A, e-"(2^'-^'=i . (4.22) 

n— ♦oo ' ^ 

i=l 

In the case of a Poisson distribution of points on a manifold, the above equation 
is not true since it relies on the exact equality to 1, and once things are 
discrete nothing is exact. Thus, instead the limit can be replaced with the 
same expression where n is assumed to be large, but finite, constant: 

Roo ^ X] A, e-"(2^-^.-i . (4.23) 

i=l 

Substituting this into the earlier defined expression for the Lagrangian in terms 
of eigenvalues. 



if ^{k\Xk <-e + e-"(2Ai-E,ti ^,-R?y > q . 

(4.24) 
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In the above expression, 

^{k\Xk > e + e-^-'^i-^U ^.-«)'} = d - 1 (4.25) 

means that the number of points that are "visibly" (i.e., by more than e) 
greater than pQ is d — 1. If n and e are adjusted in such a way that the 
expression for estimation of po can not be visibly larger than pQ itself, it implies 
that every single is visibly larger than po- 

On the other hand, 

tl{A;|Afc < -e + e-^^^^^-^.-i^^"^)'} > (4.26) 

implies that there is at least one A that is visibly smaller than the expression 
for pQ. Again, assuming that e and n are adjusted in such a way that po can 
not be visibly smaller than the equation that approximates it, this implies that 
there is at least one pi that is smaller than po. 

The above makes it obvious why the C was expressed in the form that it 
was. Now, remembering that, once the quartic equation is solved, Aj them- 
selves are expressed in Lorentz covariant form, this implies that the expression 
for Lagrangian density is covariant as well. Its only problem is the presence of 
unwanted contractions of the Ricci tensor with itself. But these contractions 
are still covariant; they were just never observed in nature. 
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4.2 Difficulties with Non- Gravitational Type-1 Fields 

In order to motivate the introduction of type-2 version of non-gravitational 
fields, let's briefly outline other difficulties that type-1 theory encounters. One 
such difficulty was mentioned to me by my advisor, Luca Bombelli. It is related 
to introducing SU(n) flelds. While it might still be possible to do better in the 
future work, the current type-1 definition of Lagrangian generator relies on a 
choice of coordinate system in which electric and magnetic fields are parallel. 
Existence of that coordinate system was guaranteed by the comparison of 6 
degrees of freedom of F^^ and 6 degrees of freedom of Lorentz group, which 
would not work for SU(n) which has far more than 6 degrees of freedom. 

There is a way around the issue, if one is willing to sacrifice the fundamen- 
tal SU(n) covariance. In particular, one can view A^, A^, etc. as completely 
different fields which, independently from each other, have identical type-1 La- 
grangian generators. As a result, they have identical Lagrangians, and their 
sum just happens to APPEAR to possess SU(n) symmetry. But this is a 
result, rather than a cause, of specific Lagrangians. In other words, the sym- 
metry is entirely coincidental. Furthermore, as a consequence of non-linearity, 
the symmetry is no longer exact. After all, the Alexandrov sets that used to 
minimize variation of each individual A"- might be completely different from 
the ones used to minimize their linear combinations. 

Up to the order of some approximation, however, this would still reproduce 
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predictions of quantum field tfieory, since the Lagrangians match in hnear case, 
whether we call it symmetry or not. So this can be viewed as an ideological 
sacrifice as we are no longer calling it electroweak theory, but rather it is 
separate electric and weak fields. This sacrifice might still be worth it if type- 
1 theory proves to be the only one producing real results in light of type-2 
theory being more complicated. But still one has to be aware of what is being 
sacrificed. 

There is also another difficulty, which has to do with type-1 fields in 
general, not only SU(n) ones. This has to do with the stochastic nature 
of assigning numerical values of fields to points or pairs of points of a causal 
set. For example, suppose the values of fields are distributted on a bell curve 
of width 1/1000 around 1. Despite the fact that the width of bell curve is 
extremely small, statistics tell us that, as long as there are enough points (or 
enough pairs of points), there is 99 percent chance that some values of fields 
will be far from 1, say, they might they might be greater than 10 or less than 
—10. This would have been okay if all values of fields were averaged out. But 
in light of the maximization procedure, only these "obscure" field values would 
ever get a chance to be looked at. 

There is a way of adressing this. Namely, if we live in a world full of type-1 
fields, we would never have a chance to learn that any of the "more common" 
values occur. So, no matter how far apart the "obscure" values are spaced, 
according to us they are very close to each other, since we never observe any 
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variation that is smaller than that. Furthermore, even if in reality they are 
very rare, to us they would be the most common ones, if not the only ones 
that ever exist. 

However, whether such an answer is satisfactory or not is ultimately up to 
the numeric tests since, in light of the fact that Alexandrov sets overlap, it is 
possible that less obscure values of fields will figure out somewhere, allowing 
us to "catch ourselves" . So, until such numeric work is done, it is best to hope 
for the best and, at the same time, have ammunition for the worst. In other 
words, it is best to retain type-1 theory in case it works, and also introduce 
type-2 theory in case it doesn't. 

4.3 Type-2 Lagrangian Generators 

In light of the above difficulties we will introduce an alternative model of 
Lagrangian generators, type-2 one. The key idea to the model of type-2 fields 
is that, as will be shown in the next three sections, in the linear case. 




(4.27) 





(4.28) 
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+ / Via{r,q))), (4.29) 

Jv{a{p,r))>V(a{p,q)) ^ 

for appropriately adjusted k and E. 

This means that it has the following advantages and disadvantages: 
ADVANTAGES: 

1) Lagrangian can be estimated based on arbitrary Alexandrov set rather 
than specifically selected one 

2) Gravitational Lagrangian is reproduced 
DISADVANTAGE: 

1) The coefficients E have to be appropriately adjusted. 

Adressing the above-mentioned disadvantage is an important issue that 
needs to be adressed, is adjusting the coefficients. In hght of the fact that the 
theory is aimed at arbitrary causal set, not just manifold-like, if E is viewed as 
a constant, this would imply that it was an extremely lucky coincidence that 
it happened to lead to exact cancelation in a special case of a four-dimensional 
manifold. This, of course, is not satisfactory. 

For that reason, it is viewed as variable, subject to some physical laws 
that point-wise adjust it to the values that it should have at any given point. 
In other words, if field of interest is F e ^, then 

E = E(F, x) (4.30) 

is a function of both point x as well as the value of fields F e ^ in its 
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surrounding. Since E is not a separate field, the path integral does not include 
integration over E. Rather, it is given as 



Definition of E for a general causal set requires a notion of the degree of 
relativistic non-covariance. E is selected in such a way that minimizes that 
non-covariance. If reference frame is identified with the axis of Alexandrov 
set, the degree of non-covariance is how much the answer changes depending 
on the choice of Alexandrov set. 

In case of linear fields in fiat Minkowski space, non-covariance is zero as 
long as E is appropriately adjusted. In case of discritized curved space, as a 
consequence of discretization it is not possible to select infinitesimal region, 
which means that some small non-covariance will appear as a result of specific 
behavior of a curvature, but as long as the fields are well behaved differentiable 
functions it should be very small. 

However, in case of general causal set, most of the assumptions that are 
made for the manifold can no longer be trusted, which means that it is possible 
that degree of non-covariance is large no matter what E is selected to be. 
However, it is still possible to FORMALLY select E in such a way that it 
would minimize the degree of non-covariance, even though the minimum would 
still be very large. 

In light of the fact that in the near hghtcone region of point p fields vary 




(4.31) 
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uncontrollably, the value of E might have nothing to do with the one it would 
have had in linear case if all possible Alexandrov sets {a{p,q), T{p,q) = Tq) 
were considered. For that reason, the varying Alexandrov set is constrainted 
to vary within the boundaries of some other, larger, Alexandrov set a{P,Q). 
These smaller and larger Alexandrov sets satisfy the constraint 



For any point a; in a causal set, ari,T2 {^) is defined a set of triples {E, P, Q) for 
which VarT^{C, F, E, a{P, Q)) is minimized with a constraint t(P, Q) — T2: 



c^r^^A^) = {{E, P, Q)|r(P, g) = r2 A V(£;', P', Q'){t{P', Q') = T2 ^ 

^ Varr,{C,F,E,a{P',Q')) > Varr,{C, F, E,a{P,Q)))} (4.34) 



Typically, arj^^r2,F{x) is a one-element set, and Lagrangian at x is simply given 
by jC{F, E, P, Q) where {E, P, Q) is a unique element of that set. But in order 
to formally accommodate extremely rare cases where ar^^r2,F{x) has more than 
one element, the Lagrangian at x is formally defined as an averaging of the 
above over all elements of cKt-j ,7^,^(0;) : 



Ti = r(p, q) < t{P, Q) = T2 



(4.32) 



The variation, then, is defined 



Varr,{C,F,E,a{P,Q)) 



(4.33) 



■max{C{E, a{p, g))|P -< p ^ g -< Q A r(p, q) = Ti} 




(4.35) 
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Now let's switch gears and go back to the expression for C{E, a{p, q)), which, 
from now on, will be referred to as pre-Lagrangian. As the examples in the 
beginning of this section illustrate, the expressions for JC{E, a{p, q)) look sim- 
ilar for scalar, gauge, and gravitational fields in a sense that they all look like 
a linear combination of a function of p and q and some form of integral of the 
same function over the interior of a{p, q). So the natural question arises: why 
do Lagrangians take this particular form and not any other? 

That question is answered by introducing a concept of type-2 Lagrangian 
generator and formally defining a procedure of going from type-2 Lagrangian 
generator to pre-Lagrangian in such a way that the above mentioned linear 
combination arises in a natural way if one formally follows the steps of the 
procedure. 

Suppose J: 5"^ ^ R is some real valued function, and suppose f,g:S^^ 
S"^ are defined as follows: 




(4.36) 



Then 





(4.37) 



and 




(4.38) 
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Thus, the pre-Lagrangian is expressed as 

C,^FM^^^+^^J,'rJ^p,r) (4.39) 
can be rewritten as 

^ ^I^^r^ I (^(f(P^ ^' ^)) + EJigip, q, r))) . (4.40) 

\Pi Q) Ja{p,q) 

In order to preserve time-reversal symmetry, we would like to replace Jl{f{p, q, r) 
and J {g{p, q, r) with J {f(p, q, r))+J{f{q,p, r)) and J {gip, q, r))+J{g{q,p, r)) 
respecitively. Thus, pre-Lagrangian is given by 

Cj{F,E,p,q)^ I d''r(j{FJ{p,q,r)) + JiFJ{q,p,r))+ 

\P-i q) Ja{p,q) ^ 

+ E{J{F, g{p, g, r)) + J{F, g{q,p, r)))) (4.41) 

Since t(p, q) is constant, it can be thrown away. Furthermore, in order to 
allow for the integrals such as 

/d^.d^.d^.(a(,.. (4.42, 

in electrodynamics, /(p, q, r) and g{p, q, r) should be generalized to /(p, q, ri, r„) 
and g{p,q,ri, ...,r„). 

This leads to the final definition of type-2 Lagrangian generator and cor- 
responding pre-Lagrangian. For the sake of completeness, and convenience to 
the reader, the following definition will formally include both the transition 
from Lagrangian generator J' to pre-Lagrangian C{F, E,p, q) just discussed 
as well as the transition from the above pre-Lagrangian to actual point-wise 
Lagrangian C{F, x) discussed earlier in this section: 
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DEFINITION: Let F be a set of the possible distributions of a field of 
interest. A type-2 Lagrangian generator is a triple {J' : J-' x ^ M., f : —>■ 
S'^,g: S'^ — > S'^). The pre-Lagrangian corresponding to the above type-2 
Lagrangian generator is Cj: x W x S"' —>■ M., given by 

Cj{F,E,p,q)^ J2 (j{FJ{p,q,r,,...,rn))+J{FJ{q,p,n,...,r^))+ 

p<ri<q 

+ E{J{F, g{p, q, n, r^) + J{F, g{q,p, n, r^))) (4.43) 

The "variation" of this pre-Lagrangian is a function Far j-^^^ : J^xRx{{P,Q)\P -< 
(5} — > IR given by 

Varr,{J,F,E,a{P,Q))^ (4.44) 
= max{C{E, a{p, q))\P -< p -< q -< Q A t{p, q) = ri} 
For any x E S, the (F, ri, r2)-based neighborhood of x is given by 

«n,^,F(x) = {{E, P, Q)\t{P, Q)^T2A W{E', P', Q'){t{P', Q') = r2 ^ 

^ Var,,{C, F, E, a{P', Q')) > Var^^iC F, E, a(P, Q)))} (4.45) 
Finally, the pointwise Lagrangian density corresponding to is given by 

jCj,r.,r,{F,x) = - Yl ^{F,E,P,Q) (4.46) 

4.4 Type-2 Scalar Fields 

In the rest of the chapter, the machinery of type-2 Lagrangian generators will 
be used to predict Lagrangians for particular fields. While it was shown that 
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gravity if viewed as type-1 field leads to erroneous results, the converse is not 
true: the fields that were successfully viewed as type-1 fields, such as scalar 
field, can be viewed as type-2 fields with equal success. 

Of course, in reality scalar field, just like other non-gravitational fields, 
are either type 1 or type 2; they can't be both at the same time. However, 
there are only two kinds of experimental tests between these theories: 

1) Higher order terms created by curvature are predicted to differ between 
type-1 and type-2 theories. 

2) A the type-2 theory predicts that masses, charges and gravitational 
constant differ from dimension to dimension, while a type-1 theory does not 
predict that. 

Since neither kind of experiment can be carried out in the near future, 
both kinds of theories should be studied for non-gravitational fields. In this 
section we will study typc-2 scalar field. Then, in the following two sections 
we will do type-2 gauge and gravitational fields. 

The Lagrangian generator for scalar field is given by (J", /, g) where 

r, s) = (0(r) - - ^(mV'(r)) (4.47) 

fin, r2, rg, Ti) = (ri, ra) , g(ri, ra, rg, ri) = (rg, r4) (4.48) 

For the reasons that will soon become apparent, m is not the actual mass, 
although it is related to it. In fact, m is assumed to be very small, of the order 
of T. The above expression implies that the pre-Lagrangian for scalar field is 
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given by 

C{<f>,E,p,q)^ J d''rd''s{J{f{p,q,r,s)) + EJ{g{p,q,r,s)))^ (4.49) 

= j d''rd''s{J{(t>,p, q)+EJ{(t>T,s))^J{(t>,p, q)V\a{p,q))+E j d'^rd'^sJicj^^r, s) 

Here, the factors of 2 appear as a result of permuttingp and q. By remembering 
that the volume of n dimensional ball is 

we obtain 



where 

27r('^-i)/2 

" (d-i)r(2'^-2d(d-i)) ^^■^'^^ 

for some constant /c^ that will be computted later. Substituting the above 
expression for volume, along the expression for J' in the above equation for C 
we obtain 

£(0, E,p, q) = ky'ip, q) - - + 

+ £;y" d'^rd'^s((<^(r) - (P{s) f - m^(p'^{r)) (4.53) 

We would now like to compute Lagrangian density, if the space time is assumed 
to be flat Minkowskian and (p is assumed to be linear. 

Lets start from the mass term. It is given as 

£„(0, E,p, q) = ^r'^ip, q){<t>\p) + <t>\q)) + Em' j d''rd''s(l>'{r) (4.54) 
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The above expression tells us that the leading order of the mass term is mr^". 
When we will get to the kinetic term, it will be shown that the order of 
magnitude that we are interested in is r^'^+^. Thus, if m is assumed to be of 
the order of r, then the leading order is r^'^"'"^ which coincides with a leading 
order for the kinetic term. 

This means that, as far as mass term is concerned, we can throw away all 
the higher order terms, which can be done by using an approximation 0o 
which tells us 

E,p) ft. m'ky\p, q)ct>l + Em'ct>lV\a{p, q)) = 

^{1 + E)m^kl4>y\p,q) (4.55) 
Now let's look at the kinetic term. The linearity assumptions imply that 

(0(r) - (l){s)f = (r^ - s^)(r'^ - (4.56) 

Consider a coordinate system in which t axis passes through p and q, while 
origin lies in the midle between these points. Denoting r(p, q) by r, 

p = (-r/2, 0, 0, 0) , ? = (t/2, 0, 0, 0) (4.57) 

In this coordinate system, 

f d'^rr^' = (4.58) 

J a(p,q) 

since the above integrand is antisymmetric with respect to the center of Alexan- 
drov set. By slicing Alexandrov set on balls t — const, we get 
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where 

" {d - l)r(2<^-i(d - l)d{d +l){d + 2)) ^^'^^^ 
Furthermore, it can be shown that 

f {x^fd'^r = /diT''+' (4.61) 

J a(p,q) 

where, by cyhndrical symmetry, the coefficient is the same for each k and is 
given as 



27ri-i 



(d-2)r(i-i) 

Substituting these expressions into the integral we obtain 



(4.62) 



d-l „ d-l 



I d'^rd'^s{(t){r) - (l){s)f = ( / dh^ ^rf'^r(r'^)' + ( /" d'^r) Y^d'^s{s>'f = 

- 2A;rfr2''+2((/,o + hi{d - l))(5o0)2 - /di(ci - l)a^0a^„0) (4.63) 



Thus, the kinetic term of the pre-Lagrangian is 

Am(0,^,P,g)= (4.64) 

= A;rfr2'^+'((ao0)2(A;, + 2£;,(/,o + /di(c^ - 1))) - 2Eahi{d - l)d^^cl>d^cl>) 

Switching from the coordinate system in which t axis passes through p and q 
to the arbitrary one, the result becomes 

- 2kaEahiT^''^\d - l)a>a^0 (4.65) 

If the choice of points p and q varies with constraints that both the mid-point 
between p and q as well as the Lorentzian distance between the two points 
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are fixed, the E,p,q) undergoes the variation of the order of r^'^"'"^ due 
to (g'* — p'^){(f — p'')d^phidy4> term. Mass term, on the other hand, only 
gives variations to higher orders. Thus, if the variations of the orders higher 
than r^*^"*"^ are neglected, then the variation can be "minimized", or in this 
particular case, set to 0, if 

kd + 2Ed{ho + hi{d - 1)) = , (4.66) 
which determines the value of Ed- 

^ -2iIdo + llid-l)) ■ ^'-''^ 
Substituting this into the expression for Lagrangian gives 

^dO + ^dl[(l — i-) 



k 



d 



2{Ido + Idi{d-l)) 
This Lagrangian can be rewritten as 



C = hf-^dydd,<l>d - , (4.69) 



where Vq is volume taken up by one point and 



k A d+i / Id\{d- 1) 

^^0 \ Ida + Idi{d-l) 



rud = -kdT\l - ^.^ J', (4.71) 

In future sections other fields will be similarly scaled, but the coefficients will 
be different from field to field. At first this might seem wrong since the kinetic 
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terms of all Lagrangians have the same coefficient 1 in standard quantum field 
theory. But it can be easilly shown that this difference does not amount to 
anything but the change of overall factor: 

J eMiS{(l>i) + ... + iSM) = (4.72) 

= p\d--pnd J [D(f)id...V(f)nd] exp{iS{(f)id/p) + ... + iS{(f)nd/p)) 

where 

(pkd = Pkd(pk (4.73) 



4.5 Charged Type-2 Scalar Field 

Consider a charged spin-0 particle, described by a set of complex scalar fields 
(f)— (01, 0n) coupled to a SU(n) gauge field. (Notice that in this approach to 
the dynamics of matter fields in causal set theory, although it will be assumed 
that spacetime is discretized, the internal degrees of freedom will still have a 
continuous invariance group.) The dynamics of such a field can be described 
in the continuum starting with the matter Lagrangian density 

C^g^^u, 0, A^; x) = \ \g\"^ [g^^ {D^(t>)^ {D,(t>) - 0^0] , (4.74) 

where the gauge covariant derivative is defined as usual by Dfj^cj)'^ :— d^cj)'^ + 
ieyl^^b^'', with = Aj^T^ the Lie-algebra-valued connection form repre- 
senting the gauge field on a differentiable manifold. (Here, Latin indices a, 6, 

are Lie-algebra tensor indices, while A;, /, label elements of the basis 
of the Lie algebra.) In the causal set context, the scalar field will be simply 
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replaced by a corresponding field defined at each causal set element, but to 
write down the action, it is important to specify what variables will replace 
A,. 

As done in the type-1 approach, gauge field is defined in terms of holonomies, 
where by holonomy refers to the group transformation corresponding to the 
parallel transport of a Lie-algebra-valued field such as between two points 
p and q. In a differentiable Lorentzian manifold M (of dimension d), holon- 
omy is defined as the function a : M x M ^ TSU(n), where T stands for 
tangent bundle, which means that TSU(n), being a tangent bundle to SU(n), 
consists of all n X n tensors with trace 0. This map assigns to any two elements 
p, g e M the holonomy of along the geodesic segment j{p, q) connecting p 
and q in M, given by 



in terms of which the expression D^cplx) appearing in the scalar field La- 
grangian arises from the leading-order term in the expansion of the expression 



This means that the causal set version of the charged scalar field La- 
grangian can be obtained by making some simple substitutions in the one 
obtained in the previous section for the Klein-Gordon field. Thus, if gauge 
field a is assumed to be fixed and not subject to any Lagrangians, then the 
type-2 Lagrangian generator for a matter field e R" interacting with a is 




(4.75) 
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given by {J , f, g) where 

JsU<t>.<^.r,s) = |0(r)-a(r,s)0(s)|^ + — (0*(r) + 0*(s))((/)(r) + 0(s)) (4.76) 
and 

fsin, r2, ra) = (ri, rg) , ^^(ri, ra, rg) = (ri, rs) (4.77) 

However, as discussed in the next section, a, itself, is subject to type-2 La- 
grangian generator given by 

JvAiia, ri, r2, ra) = tr[(a(ri, r2) + a(r2, rs) + a(r3, ri))^] 

fYM{ri,r2,r3,u,r5) = (r3,r4,r5) , 5'yM(ri, r2, rs, r4, rs) = (ri,r2,r3) . (4.78) 

In order for the theory to possess SU(n) symmetry in the setup, these two 
Lagrangian generators are combined into one as {Jtot, f, g), 

Jtot{a, ri, r2, rs) = Jscai{(p, a, ri, + JYM{a, ri, r2, rs) 

ftotin, r2, rs, r4, rs) = fruin, r2, rs, r4, r^) (4.79) 
9tot{ri, r2, rs, r4, rs) = gyniri, r2, rs, r4, rs) 

4.6 Type-2 Yang-Mills Field 

In this section, the main goal is to express the Yang-Mills Lagrangian density, 

CYM{gf.u,A^;x) = \ |<7r/'tr(F^,F^'^) , (4.80) 

in terms of the holonomy variables for the gauge field introduced in the previ- 
ous section, as well as variables describing the geometry that are meaningful in 
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the causal set context, namely causal relations, and either volumes or timelike 
lengths. Once this is done, the Lagrangian density can be easily rewritten in 
the discrete setting. 

In the causal set context, the gauge field is defined as a: 5"^ — > TSU(n) 
where TSU(n) denotes a tangent bundle to SU(n), thus it consists of n x n 
tensors of trace 0. The type-2 Lagrangian generator for gauge field is given as 
(^,/, g), where 



fin, r2, ra, r^, rg) = (n, ra, r^) , g{{ri, ra, r^, r^, rg) = (rg, r^, rg) . (4.82) 

This means that in the case of fiat Minkowski space, the pre-Lagrangian is 
given by 



J{a, ri, r2, rg) = tr[(a(ri, ra) + a(r2, rg) + a(r3, ri)f] 



(4.81) 



and 




d^Td'sdHiJifip, q, r, s, t)) + J{f{q,p, r, s, t))+ 



+ E{J{g{p, q, r, s, t)) + J{g{q,p, r, s, t)))) 




d'^rd'^sdH{J{p, g, r) + J{p, g, r) + E{J{r, s, t) + J(r, s, t))) 




(4.83) 




It can be easily seen that 



J{p,q,r) = J{q,p,r) 



(4.84) 
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which imphes 

£ = 2 (y^{a{p, q)) I d'^rJip, q,r) + E I d'^rd'^sdHj{r, s, t)) (4.85) 

Assume that the gauge field is differentiable and reasonably well behaved. In 
particular, it is well-behaved-enough for F^^,'' to be approximately constant 
in a{P,Q) whenever {P,Q,E) e CKa,Ti,T^{u) for some u & S. Assume for 
definiteness that the three points are spacelike related. Choose a coordinate 
system so that r coincides with the origin, the x axis points from r to s, and the 
y axis is perpendicular to the x axis in the rst plane. Then in this coordinate 
system a = (0, 0, 0, ...), b = (0, b\ 0, ...), c = (0, c\ c^, ...). 

The flux of F^^^ through the interior of that triangle is expressed by the 
relationship 

a(r, s) + a{s, t) + a{t, r) = i F^'' + ... (4.86) 

This result generalizes to points at arbitrary locations, and can be written 
covariantly as 

a(r, s) + a(s, t) + a(t, r) = \ F^/ T^s^" - r>'){f - r") + ... (4.87) 

RecaUing that, for SU(n), tr (T'^T^) = C2 Ski, to leading order in the separation 
between points, 

tr[(a(r, s) + a(s, t) + a(t, r)f] = (4.88) 

Let's start from 

I d'^rd'^sdHjir, s, t) (4.89) 

JT{p,q) 
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Expand the right-hand side of Eq (I4.88P , and integrate term by term. Clearly 
any term with an odd number of powers of any variable will integrate to 0. 
Thus, the only terms that may potentially survive the integration are those 
of the form r'^ r'^ or quadratic terms in two of the three points. Simple 
counting of terms gives 



where V is the volume of the Alexandrov set a{p, q). 

The only terms of F^^'^ FpJ' j-^^p^" that survive integration are the ones 
whose indices are pairwise equal. But if either fi = u or p = a then F^,^'^ = 
or Fp^'^ = 0, respectively, which would set the whole thing to 0. Thus, the 
only options are fj, = p, u = a and u = p, p = a. The antisymmetry of F^j/'^ 
then implies that these two cases are opposites of each other, which in turn 






(4.90) 



implies that FpJ' FpJ' r^^ r"" = 0. Thus, Eq (14.901) becomes 




3 fcdC2r3'^+4 



d-l 



2 



Y: {j'j' J2{F.or + {j'f ' (4-91) 



k 1=1 i<j 
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where = t ^ Ia{pq) ^'^^ {x'^)'^, or in other words 



-7r/2 

Now let's move to the second integral, J^^^pg^ d'^x tr[(a(p, x) + a(x, Q')+Q.(q',p))^], 
where p -< q are the endpoints of the Alexandrov set. Rewriting Eq fl4.88p in 
terms of the points p, x, and q gives 

tr [{a{p, x) + a{x, q) + f{q,p)f] (4.94) 

= i C2 F^/' b'^ - x^) (g'^ - x") FpJ {pP - x") {q^ - x'') . 

Again this can be expanded and integrated term by term. There are several 
conditions each term has to meet, in order for its integral not to vanish. First 
of all, it needs to contain an even number of factors of x. Secondly, as was 
shown before, for symmetry reasons 

i^M^ Fpa'' I d'^x x^ x'^ x^ x" = . (4.95) 

Ja{p,q) 

Finally, F^^^p^p'^ = F^j^q^q'^ = and identities p = (—|, 0,0,0) and q = 
(|, 0, 0, 0) imply Ff.^pf'q" = -F^^p^'p'' = 0. The only terms in Eq fOSD that 
do not vanish for any of the above reasons are 

F F rf r'' F F n'^ r" r'' n°' 

Fp." FpJ x^ q" pP x-^ , F^/ x'' q^ x" g'^ . 

Plugging in the coordinate values of p and g, each of the above four expressions 
evaluates to \ F„,n^ F„n^ x^ x'' . In order for this not to be an odd function, 
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IJ> — p has to hold, and in order for F^q to be non-zero 7^ has to hold. 
Thus, this becomes (-Pio^)^ (2^*)^ and, since there are four such terms, the 
integral becomes 

/ d'^x tT[{a{p,x) + a{x,q) + a{q,p)y] 

J a(p,q) 

= i^^^'E / d'x{F,o'n^r = ic,r'+'J'J2(^^o'r ,(4.96) 

i=l Jocip,q) 

where, based on rotational symmetry, = ... = J'^~^ was used. 

Substituting this into original expression for pre-Lagrangian and doing 
some basic algebra, the latter becomes 

i<j 

In order to get rid of the variation that results from different choices of Alexan- 
drov sets, we would like the above expression to be relativistically invariant; 
in other words, we would like this to be proportional to F^^^F^i,. This means 

ZEaf + ^ = -?,EaJ' , (4.98) 



which implies 



Substituting this expression for gives a Lagrangian 



>C(a,P, g) = ■ (4-100) 



This can be expressed as 



/:{a,p,q)^-^FrFa,. (4.101) 
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where Vq is a volume taken up by a single point if Fj*^ is defined as 



This means 



Prom this we can determine how the charge changes from dimension to dimen- 
sion. On the first glance, since we haven't yet defined Lagrangian for fermions, 
we are only ready to talk about the charge of bosonic fields. However, a sim- 
ple symmetry consideration allows us to overcome this barrier and include 
fermionic charges in a discussion. Whether a field is bosonic or fermionic, we 
would like to be able to say 

d^^d, + eA^ (4.104) 

We would also like to be able to say 

d^^d^ + eaAd^ (4.105) 

This means that, regardless whether the field in question is bosonic or fermionic, 
and regardless of any other properties of the field (such as mass) it should sat- 
isfy 

eA'' = e^A^ (4.106) 
This immediately implies that, both for bosons and fermions. 



e vojj^ + Ji) 
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4.7 Type-2 Gravity 

Now let us move to the number one reason that type-2 fields were invented: 
gravitational field. The Lagrangian generator for gravity is (J", /, g) where 



8^ if r ^ i ^ s 



otherwise 



(4.108) 



The function / and g are defined as 

/(r, s, t, u) = (r, s, t) , g(r, s, t, u) = (r, t, u) (4.109) 
This means that pre-Lagrangian is given by 

C = J d'^rd'^s^{-lY-^ det g{r)^/{-lY-^ det g{s) x (4.110) 

X {J{f{p, q, r, s)) + EJ{g{p, q, r, s))) 

Since J" is a constant inside a certain domain and is outside, the above 
integrals amount to restricting r and s to a certain domain. The first term is 
J^{p, q, r) thus its restriction is p ^ r -< q. That statement, of course, is trivial 
which means that the first term has no restriction at all. On the other hand, 
the second term is J'{p,r,s), thus it has a restriction p ~< r ~< s ~< q. Thus, 
the pre-Lagrangian is 

^ " 8^ ( / d'^d'^Vi-^r-' det g{rW i-iy-' det g{s)+ 
+ E f d'^rd'^sy/i-iy-^ det g{r)^/{-ly-^ det g{s)] = (4.111) 

J p<r-<s-<q 

^{V{a{p,q))f + E J d's^/{-lr-'detg{s)V{a{p,s)) 
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One can parametrize the interior of that Alexandrov set with normal geodesic 
coordinates around p. Suppose index ii stand for arbitrary coordinates, satis- 
fying 

9M=nnv (4.112) 

while not necessary geodesic. Then one can define the normal geodesic coor- 
dinates derived from the above in the following way: 



r''^V^"d,\p'yprJ [ g^udxi^dx- (4.113) 

y Jlpr 

where ^ab denotes geodesic segment connecting a and h. From this, it is 
straightforward to see that the following equation is satisfied exactly: 

r(p, r) = rtpjpT^r^ (4.114) 

It should be noticed, however, that the above is true only if one of the two 
points is p: 

T(r, s) ^ Tfj^isP - r^)(s^ - r^) (4.115) 

However, a different causal relation, -<p will be introduced in addition to al- 
ready existing one -<. While -< matches flat space expectations only if the pair 
of points in interest includes p, does so for arbitrary pairs of points: 

a-<ph^ ri^{h>' - oF) (6^ - a^) > (4.116) 

At the same time, we will retain an original causal relation, -< for which the 
above is not true: 

^T^^p (4.117) 
107 



Let define the following notation: 

J+(a) = {r ^ a} , J+(a) = {r a} (4.118) 

J-(a) = {r ^ a} , J-(a) = {r a} (4.119) 

a{a, b) = J+(a) n J-{b) , ap{a, b) = J+{a) n Jp{b) (4.120) 

Also, lets define Vp{a,b), which is not to be confused with V{ap{a,b)), as 
follows 

Vp{a,b) = UvAb' - a^ip^ - a^Y'^ (4.121) 
It is important to notice that 

Vol{ap{a, b)) ^ Vp{a, b) (4.122) 

because the definition of Vp neglects •>/ {—lY'^'^g factor in the volume element. 
But, for our purposes, Vp as defined above is the simplest to use. On the other 
hand, V{a, b) is defined as the actual volume of Alexandrov set: 

V{a,b) = Vol{a{a,b)) (4.123) 

Then, the integral of the volume can be expanded as follows 

I (fr^J{-lY-^deigV{aip,r)) = I d\Vp{p,q)+ 

+ Aw(p, q) + A2d(p, q) + A3<i(p, q) + 0(r^"+^) (4.124) 

where Airf(p, q) is an error due to the mismatch between Q;(p, q) and ap{p, q), 
A.2d{p, q) is a correction due to the error in V{a(p, r)) and A3 is an error due 
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to the use of dfir instead of dfir -^f {—lY~^detg for a volume element. They are 
formally defined as follows: 

Aidip,q)= [ d'^rVp{p,r)- [ d''rVp{p,r) (4.125) 

{a{p,q)\ap(p,q)) J oip(p,q)\a(p,q)) 

A2d(p,g)= / d''r{V{p,r)-Vp{p,r)) (4.126) 
A3d(p, q)= [ d''r{y/{-lY-^detg - 1) (4.127) 

Whenever any of these three correction terms are computted, the other two 
are neglected since the "correction of the correction" is of the order 0(t^''+^) 
while the calculation is performed to order 0(t^'^"'"^). Thus, the shape of 
Alexandrov set is assumed to be unchanged in the calculation of and A^^, 
g is dropped in calculation of Ai^^ and and the correction to 
V{a{p,x)) is neglected in calculation of Ai^ and A^d 

Let's start with computting Ai^. 

In normal coordinates, the lightcone of p is not deformed by curvature, 
while hghtcone of q still is: 

J+(p) = j;(p) , J-(q) ^ J-(q) (4.128) 
Substitution of above into simple set theory algebra gives 

ap{p,q) \ a{p,q) = J^ip) H {J-{q) \ J-{q)) C J-{q) \ J-{q) (4.129) 

aip, q) \ ap{p, q) = J^ip) H ( J-(g) \ J-(g)) C J-{q) \ J;{q) (4.130) 

Assuming that J + p~{q) and J~{q) are very close to each other, the above 
implies that most of the contribution to Ai^ comes from the vicinity of Jp{q)- 
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So, lets evaluate Vp{p,r) for r & (q). In other words, assume that 

J2irr-ir-rr (4.131) 

This imphes that 

^^M^F ^ ^^0)2 _ _ ^0)2 ^ ^^2r° - r) (4. 132) 

By using 



2r° - T = r° - (r - r°) = r° - J](r*^)2 (4.133) 

this becomes 

^r^r^ = To (r° - V^(^) (4. 134) 

Let's define two functions xi X2 as follows: 

Xi(r)=/ Unj^r^n^'^ - I hirtj^r^rn''/^ (4.135) 

Ja{r,q) J ap{r,q) 

X2(r)= / fc.r^/^(r°-v^ 

^ afr.o) 



ci/2 



As a consequence of the fact that J~^{p) — Jp{p), 

Xiip) = X2(p) (4.137) 
Furthermore, it is easy to see that 

Xi(?) = X2(g) = (4.138) 

This means that 

rT(.p,q) (]yo( r(T)) 

Ai, = xi(p) = -(X2(g) - X2(p)) = - ^^^^^k^ (4-139) 
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where r(T) is defined as a point on whose distance to r is r: 

7pr(T) C , Z(7pr(r)) = T (4.140) 

This immediately imphes that 

St > ^ r{T) -< r{T + 6t) (4.141) 

Furthermore, in normal coordinates 

r'^(r) = (4.142) 

where is a tangent vector to jpg at p. The fact that the latter is timelike 
implies that 

St>Q^ r(r) r(r + 5t) (4.143) 
In order to compute dx2{j' {t)) / dr , notice that, schematically, 

5x2{r)^ f f- f f- f f+ f f = 

= / /-/ /-/ /+/ / = 

^5l\52 ^52\5l JS3\S4 JS4\S3 

= / /-/ /- (4.144) 

-I f 

J {S3\S4)\{S4\S3) J{S4\S3)\{S3\S4) 



where 



Si = a{r{T + St), q) , S2 ^ Oip{r{T + St), q) , 

Ss = a{r{T), g) , ^4 = «p(r(T), q) (4.145) 
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Substituting both r(T) -< r(T + St) and r(T) -<p r(T + St) into simple set 
theory algebra, one obtains 



(a(r(r), q) \ ap{r{T), q)) \ {a{r{T + St), q) \ ap(r(r + St), q))) = 

= (J+(r(r))\ J+(r(r + 5r)))n(J-(g)\a,(r(r,g))) C J+(r(r)) \ J+(r(r + 5r)) 
{ap{r{T), q) \ a{r{T), q)) \ {ap{r{T + St), q) \ a{r{T + St), q)) = (4.146) 
= iW^)) \ j;i.r{T + St))) n ( J;(g) \ a(r(r, ?))) C j;(r(r)) \ J^^t + St)) 
(a(r{T + St), q) \ ap(r{T + St), q)) \ (a{r(T), q) \ ap{r(T), q)) = 
= a{r{T + St), q) D {J^{r{T)) \ J+{r{T + St))) C J+{r{T)) \ j;{r{T + St)) 

{ap{r{T + St), q) \ a{r{T + St), q)) \ {ap{r{T), q) \ a{r{T), q)) = 
= ap(r(T + St), q) n ( J+(r(T)) \ J+(r(r + St))) C J+(r(T)) \ J+(r(T + St)) 



Thus, all four integrals are performed either over a subset of J~^{r{T))\J~^{r{T+ 
St)) or over a subset of Jp{r{r)) \ Jp{r{r + Sr)) In either case, the range of 
integration is in a vicinity of the lightcone of r(r). 

Now let s be an arbitrary point in that region. Remembering that r 
denotes a distance from p to the point of intersection of that region with 
geodesic 7^^, and is not to be confused with a distance from p to a floating 
point, we have 



. e ( J+(r(r)) \ J+(r(r + St))) U ( j;(r(r)) \ j;(r(r + St))) ^ 




(4.147) 
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Thus, all integrals reduce to the integration over constant: 



fd^'x - / fd^x - / fd'^x + / fd'^x 

Si J S2 ^ S3 J S4 



f- f- f+ f = 

{Si\S2)\{Sa\S4) J {S3\Si)\iSi\S2) J {S3\S4)\iSi\S3) J (S4\S3)\{S3\S4) 

= {TTof' [ d'^x - {rr,f^ [ d^x- 

J {Sl\S2)\{S3\S4) J {S3\S4)\{Sl\S2) 

- i'^^oY^^ [ d'^x + {ttqY'^ [ d'^x = (4.148) 

J {S3\S4)\{S4\S3) J {S4\S3\{S3\S4) 

= ir^ofH [ d'^x- j d^x- j d'^x+ j d'^x) = 

J Si J S2 J S3 J S4 

= {rrof'iViS,) - V{S2) - V{S,) + V{S,)) 
Now, V{Si), ¥{82), V^(5'3) and V{Si) can be read off from Ref [7] as follows: 

V{S,) = {kd+{A,Rg,,+BdR^,){qf'-r^{T+ST)){q''-r%T+5T))y{r{T+ST),q) 

(4.149) 

V{S2) = hT''{r{T + 6r),q) (4.150) 

V{S,) = {k, + {ARg,, + BR,,){q^-r^{T)W -r^{r)))r\r{r),q) (4.151) 

V{S^) = k,T\r{T),q) (4.152) 

Since coordinate system is defined in terms of geodesies comming out of p, all 
of these geodesies, including 7^^ are, by definition, straight lines in the chosen 
coordinate system. Therefore, it can be assumed that coincides with t-axis, 
which simplifies the above equations: 

V{Si) = kdin -r- 6r)'' + [AdR + 5,i?oo)(ri - r - drf^^ (4.153) 
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V{S2)^ka{n-r-5r)'' (4.154) 
^(-5-3) = kdin - r)" + {AaR + BaRoo){ri - rf^^ (4.155) 
V{S^)^kd{n-Tf (4.156) 

This implies that 

X2(r(T + St)) - X2(r(T)) = {nT)'/\V{S,) - ¥{82) - V{Ss) + V{S,)) = 

= {AaR + BM{nT)<"^^{n - t)'''-'St + 0(t'^+^(5t)^) (4.157) 

ar 

which imphes 

^^^^hMiH ^ + s,i?oo)(nr)'^/^^(r, - r)'^+^ (4.158) 

Thus, 

Aid = Xi{p) - Xi{q) = X2{p) - X2{q) = (4.159) 
= (A,i? + B,i?oo) dT{nT)''/'^{n - t)''+' 
The binomial expansion of (ri — r)'^+^ gives 

Ai. = -(A.i? + BM r dT{T,T)'''j- (y,{-1)' ("^ I ^) r^^-V^) 

•^^ ^ fc=0 ^ ^ 

(4.160) 

Evaluating of the derivative and combining it with r'^/^ factor gives 

Ai. = -(^.i? + S,i?oo) drrf" ( X:(-l)'^ ("^ ^ ^) rr^-'=fcri+'=-^) 

(4.161) 

Finally, the integration of that expression gives 
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Now lets compute /S.2d- We will now go back to usual notation, where r (which 
will be denoted by x in order to avoid conflict with r being a radius of a ball)is 
an arbitrary element of a{p, q), and it is no longer assumed to lie on 7^^. As was 
stated earlier, correction is due to the error of computing the volume of 
a{p, x) where x is far away from the boundary of a{p, q). Since that correction 
is already of the order 0(t'^"'"^), its integral over a{p, q) is of the order 0(t^'^"'"^), 
while its integral over the "corrections" to the shape of q) is of the order 
Q^^2(i+4^ For this reason, the latter term will be neglected, and it will be 
assumed that the shape of a{p, q) has not been affected by curvature, while the 
shapes of Q;(p, x) has. Furthermore, in light of the geodesic coordinates, while 
the corrections to volume of a{p, x) are not neglected, the flat space equation 
for the distance will be used in computting them. Thus, the equation for 
becomes 

= AdR I d''x{xPxj,Y+^ + BdRjru [ (4.163) 
By using the cylindrical symmetry, this becomes 

A2d = A,i? I d^xix^Xj^Y+i^ 

Jap{p,q) 

+ BaRml d'^x{x^f{xPxpY'''+ (4.164) 
-rBa{Y,Rkk) [ d'xix'Yix^x^Y/' 

Jap{p,q) 

where, due to cylindrical symmetry, {x^Y was replaced with(a;^)^. 
From cylindrical symmetry, 

f d%(a;%)^+i = (4.165) 

'^ap{p,q) 
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Jap(p,q) J 

This can be used to get rid of integral involving {x^Y to get 



A2 = t2'*+2 



+ ^ {dB,H,,,,,/, - 5.i/,,o,i+i)] (4-166) 



where 



Hd,i,j = I d^x{x^)\x^x^y (4.167) 



and 



H,,,^ = ^J d^xJix^x^y^'^ (4.168) 
here j can be either an integer or half integer. 
Let us now compute these coefficients. 

In the calculations that follows we will treat the above integrands as simply 
functions, and "forget" that their source is a curvature. 

By slicing ap{p, q) into balls t — const and then slicing each ball into 
spheres, it is easy to see that 

f d%(x° - pynx'p - p^) {xjj - pp)y'^d'^x = 

Jap{p,q) 

^' dt(r drr''-''(t' -fy/')+ (4.169) 



r((d-l)/2)Vyo V io 

+ £ dt(t' £ 'dff'-^f-fy/^)) 



By changing variables to 



u^^,s^^ (4.170) 
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'o:p{p,q) 



the above expression becomes 

T{{d -l)/2) iJo ^ Jo ^ ' ) 

+ r dM(^+^+''-^ ^ ds^-\l-i'y/'^y^ (4.171) 

*/ 1/2 «/ 

Since the hmits of integration in the first term are constants, that term can 
be represented as product of two separate integrals. After evaluating the u- 
integral, the expression becomes 



Jl/2 Jo ' 

Prom now on the calculation splits into four cases: even and odd d and even 
and odd j. Prom the original intentions of the calculation it is clear that 
whenever d is odd, j is also add and visa versa. So only these two cases need 
be considered. 

CASE 1: d and j are both even 

Since j is even, denote it as 

J = 2h (4.173) 
Expanding (1 — s^)'* binomially, the integral becomes 

Jap(p,q) 
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r((d-i)/2) 



(4.174) 

/•I / /•=-! 



'1/2 

After evaluating the s integrals this becomes 



Jap(p,q) 



9^(d-l)/2 /l.^ 



i._n \ / 



r((ci-l)/2) ' \k)V2i+'^h+d(^i + 2h + d){d-l + 2k) 



(4.175) 



1/2 V d - 1 + 2A; 



After pulling expanding out d — \ + 2k and expanding out (= — 1)*^ i+2k^ ^^laX. 
becomes 

/" d'^x{x^ -p^i {x'' - pi") {xj, - pp) yi'' = 

r((d-l)/2) ^d-l + 2A;VA;y V2*+2^+'^(z + 2/i + (i) ^ ' 



1=0 ^ ^ ''^/^ 

Evaluating the integral that is left this becomes 

J d%(x° - pJiix'P - pP){xj, - pj,)y/^ = HaihT'+^+'' (4.177) 



where 



_2^^'^ (-1)^ //.\/ 1 

E k^Qd-l + 2k\k)\2^+^h+'^{i + 2h + dy ^ ^ 

i[d-l + 2k\l-{^ 



+ y (-i)r~ ^ ^^^^ ) 

^ ^ M Z i + 2h-2k + l + lJ 



1=0 
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CASE 2: d and j are both odd 

Since j is odd, it will be replaced with 

j^2h + l 

Then binomial expansion tells us 
which means that the original integral can be rewritten as 

Japip,q) 

9^(d-l)/2 /t.\ 

"r((ci-i)/2)" 1) 

Jo 



X 



(i + 2/i + 1 + (i)2^+2'^+i+'^ io 



+ 

'1/2 

Since d is odd, so is d — 2 + 2k. Thus, 

(i-2 + 2/c = 2a + l, 

where 

d-3 + 2k 
" = 2 ■ 

Thus, the integral of interest is J s^^+^Vl — s'^ds 
By using 

s = sin ^ 
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the integral becomes 

J 5^"+ Vl - ds = y"sin2"+^^cos^dsm^ = J sm'^''+^0 cos^O S (4.185) 

Combining one of the sin 9 factors with d9 gives 

J s2"+i Vl-s^ J ^ ^ d cos ^ (4. 186) 

Expanding sin^" 9 as 

sin2«^= (1 -cos'^)" = X^(-l)'(^) cos'''^ (4.187) 

&=o ^ ^ 

the above integral becomes 

/ s2»+iyr^ds = -^(-l)''[^] / cos2^+2^dcos^ = 

=-t(-o'(:) 



b=0 

'a\ cos^^+^ ^ 



6=0 

By substituting s = sin ^ this becomes 



26 + 3 



(4.188) 



J s'''^'V^'ds = -j2^(^^i^-s'r'^ (4.189) 



y (-1)' fa 

6=0 

Substituting 2a + 1 = d — 2 + 2/c we obtain 

k+ 



d-3 



Thus, the original integral becomes 

/ d% (/ - pyHx'' - pf) {xj, - pp))'^+^ = 

J ar,(v,a) 



r((d-i)/2) 
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X 



V (i + 2/i + 1 + d)2'+^h+i+d J^^^ 
By evaluating the — 1-term of the integral over u, and also pulling u out of the 
denominators, the expression becomes 

Jaj,(p,q) 



n \ / h—r\ ^ / 



r((d-i)/2) 

X ( Z '-2^' , 

V(i + 2/i+ l + (i)2*+2'^+i+'^ i + 2h + d+l 
+ J^du xr+"^+''-''-'(2TZ - l)^+t)) 

By expanding (2iZ— 1)''+^ we get 



c=0 

By setting 



' du lf+^''+''-^'-\2u - if+'i = (4.193) 

1/2 



= V2m - 1 (4.194) 

this becomes 



du u'+"'+''-''~%2u - 1)"+^ = (4.195) 

1/2 



Jl/'. 



c=0 

By expanding (v^ + i)»+2h+(i-6-2-c ^j^jg becomes 



d7lir+2'^+'^-2^-3 (2 u- 1)^^+3/2 

il/2 
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^ /7iJ_l\ r i+'2h+d-b-2-c 



c=0 ^ ^ " e=0 

j+2/i+d-fe-2-c 



c=0 ^ ^ e=0 

Substituting these into original integral we obtain 



/ d'x{x^ - pyHx^ - jf){xj, - pj,))'^'^ = H,^^,,^r"^^''-'^' (4.197) 



'ap(p,q) 

where 



_ 2„«'-m Ji,, /,A (_i) + 



X 



fc=0 ^ ^ 6=0 

1 1 _ (l\i+2h+d+l 

^2 J ^ (4_;L98) 



(i + 2/i + c^+ 1)2^+2^+'^+! i + 2h + d + l 

c=0 ^ ^ e=0 ^ ^ 

Finally, let's compute A3. In normal coordinates, 

Qju' — 'fjjrp '^RjLpva^^ (4.199) 



This imphes that 



V(-l)°'"^det^ = 1 - ^Rp^x^x" (4.200) 



Thus, 

Asd = -y^T^ y" d<^xx^x^((x" - p^){x^ - p^))"/^ (4.201) 

As was done with A2, the correction to the correction term will be neglected, 
which means that integration is performed over ap{p, q) instead of a{p, q) and 
no correction term is introduced to the V{a{p, x)) when the expression ((x" — 

P°'){Xa — Pa)Y used. 
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Substituting the above into expression for A^^ gives 

A3d = ^Rj^ [ d'xx^x^iix^ - p^){x^ - p^)Y'^ (4.202) 

By cylindrical symmetry, {x'^Y in the above integral can be replaced with (x^)^ 
which means 



^3d — -pr 

D 



Japip,q) 

-{j2Rkk) [ d'xix'^mx'' - - p^))'/'] (4.203) 



'ap(j>,q) 

Again, by cylindrical symmetry. 



/ d'x{{x''-p"){x^-p^)y+i = [ d'x{x^)\{x"-p''){x^-p^)f'- 

Japip,q) Jap(p,q) 

-{d-1) I d'^x{x^f{{x"-p^){x^-p^)Y/^ (4.204) 

Jap{p,q) 

which allows to express integral involving {x^Y in terms of integrals involving 
{x^y. Substituting this expression into the expression for Aa^ and doing some 
simple algebra gives 

A3. = -^^^^r''^'{md,o^^i-H,^,^.) + Room^^^ (4.205) 

By adding Ai^, A2d and Aa^ the total correction becomes 

Ad = Aid + + Asd = T^'^+^CdR + DdRoo) (4.206) 

where 



--.o,«(^.+^-5(i^)+^(^i^.) (-07) 
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and 

d+2 



Now, as we were computting the corrections to the integral, we almost forgot 
the main term! Here it is: 

[ d''xkd{{x'P - pP) {xj: - pT,) f^ = kdT^''H,^o,l (4-209) 

Jap(p,q) ^ 

This means that the total integral is 

[ d^xViaip, x)) - kdH^.^ ar^" + {CaR + DaRooV^^ (4.210) 

Ja(p,q) ' 

In the beginning of this section it was shown that the gravitational pre- 
Lagrangian is given by 

£(^,S,p,g) = ^(l^2(«(p,g))+ 

+ E I d'^r^J{-\Y-^AQigV{p,r)\ (4.211) 

Ja(p,q) 

By substituting the above expression for the integral as well as 

V{a{p, q)) = + T^'+'iAaR + BaRoo) (4.212) 

we obtain 

E,p, q) = T^^kaika + EHa,o,d/2)+ (4.213) 

+ r''''+^{R{2kdAd + ECd) + Rooi^kaBa + EDa)) 

Thus, in order to minimize variations, we have to get rid of 0(t^'^) contribution 
which comes form Rqq term. Thus, E^ is selected in such a way that would 
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set the coefficient of Rqq to : 



2kaBa + EaDa ^O^E^^ ^ (4.214) 

Substituting this into the expression for the Lagrangian gives 

= ^[^""kl[l-^-^H,,,,,/,)+2k,Rr^^^^ (4.215) 

As long as d is fixed, the first term in the above Lagrangian is constant and, 
therefore, does not affect physics. This means that Lagrangian that we care 
about is proportional to R as expected. 

The gravitational Lagrangian can be rewritten as 

where Vq is a dimension taken up by a single point of a causal set and is 
given by 



Ga = G .,, 0,+,, . ^ (4.217) 



However, if we release the assumption that dimension is fixed, the first 
term can be interpretted as dimension-Lagrangian which, for all the practical 
purposes, is quite separate from the gravity-Lagrangian in the second term, 
even though their origin is the same. That dimension-Lagrangian tries to 
"force" the local neighborhood of each point to have one dimension rather 
than the other, which in general are non-integer. 

While at this point no work has been done to analyze the behavior of 
dimension-Lagrangian, it is apparent that any Lagrangian-based theory as- 
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sumes the possibility of arbitrary small variation of the Lagrangian. This 
means that Lagrangian should be defined for fractal dimensions. 

We have to adress the fact that calculations performed above were as- 
suming integer dimension. While the final form of expressions for HfHj and 
longer defined for fractal dimensions, some of the intermediate 
expressions in terms of integrals are. After all, integrals of non-integer value 
can always be computted numerically, if not analytically. 

One obstacle, however, is that geometric constructions that were used to 
obtain the integrals were based on integer dimensionality, and it might be a 
good project for a future research to see if they can be generalized to fractal 
dimension. One question that needs to be asked in the future research is how 
relevent the above expressions for fractal dimensionalities. 

A related issue is whether fractal dimensions are defined in such a way that 
every single causal set has some dimension, or whether there is a "qualifying 
criteria" for causal set to be a fractal on the first place, which means that most 
causal sets don't have any dimensionality what so ever. 

In the former case, less assumptions can be made about any particular 
fractal since its criteria is more loose and therefore a fractal-based "copy" of 
earher derivations is more hkely to fail. On the other hand, in the latter case, it 
is likely that Lagrangian will force a non-fractal causal structure, thus viewing 
Lagrangian as a function of dimensionality is physically meaningless, although 
it might still have a mathematical interest. 
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Since topology of causal set is beyond the scope of the thesis, these issues 
will be postponed for future research. For our purposes right now we can 
settle with one version of an answer. According to this version, we can define 
fractal dimension in such a way that every single causal set is a fractal of some 
dimension. 

Furthermore, since Lagrangian density is formally well defined for every 
single causal set, we can simply define curvature R pointwise in such a way 
that the relation between Lagrangian density, fractal dimensionality and R 
perfectly satisfied point by point. In this case, proving this for non- integer 
dimension becomes non-issue since it becomes a statement that is true by 
definition. 

A more practical problem is that if non-integer dimensions are allowed on 
the first place, then the chances are that the "preferred" dimensionahty of the 
universe will also be non-integer. This, of course, does not help our cause in 
trying to explain the actual observed four-dimensional topology. 

As will be seen in the next chapter, the latter is done by introducing 
vierbeins, which are viewed as literal physical fields coupled to causal relations 
via a type- 1 Lagrangian. That Lagrangian "encourages" a correlation between 
the distances inferred by vierbeins and the ones inferred by causal structure 
on the local regions of a causal set. 

This increases the probability of manifoldlike causal sets over non-manifoldlike 
ones. However, this should be weighted against the fact that from the pure 
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combinatoric perspective manifoldlike causal sets are extremely unlikely. Thus, 
it is up to further research to see the overall probability of a causal set be- 
ing manifold-like, based on vierbein Lagrangian taken together with combina- 
torics. 

However, assuming that vierbein theory is successful, the fact that vierbein 
fields are type 1 implies that these can play a role of "constraints" on maxima 
and minima. This can not be said about type-2 fields for which the only thing 
that matters is the average. This might ultimately be a reason why the type-1 
vierbeins "outcompete" type-2 Lagrangian which results in our universe being 
four dimensional as opposed to a fractal. 

Nevertheless, it is possbible that type-2 dimension-Lagrangian does have 
some very small effect, and makes our dimension to be 4 — e instead of 4, thus 
providing a literal meaning to the dimensional regularization. Of course, it is 
up to further research to see if this will, indeed, be 4 — e as opposed to 4 -|- e. 

Furthermore, one should also adress the fact that e in dimensional regu- 
larization theory is assumed to be constant. One way of doing this is to show 
that this, indeed, is the prediction of the theory, which is possible in light of 
the fact that both vierbein Lagrangian and dimension-Lagrangian are constant 
throughout causal set. 

This, however, might not work due to vierbeins playing a role in fermionic 
Lagrangian or dimension-Lagrangian having a common source with gravity 
one, then one has to show that the generalized version of dimensional regular- 
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ization theory based on varying e still works. 



5 Chapter 5: Fermions on a Causal Set 

5.1 Grassmann Numbers as Vector Space Elements 

In order to apply the principle of Lagrangian generators to fermions, one has 
to compare the magnitudes of Grassmann numbers. This, of course, can not 
be defined within a framework of standard theory. In order to go around this 
obstacle, Grassmann numbers have to be defined as individual elements of a 
vector space, outside of integration. 

In this dissertation, it is proposed to view them as elements of vector 
space, S, equipped both with commutting dot product ( • ) , anticommuting 
wedge product (A), and measure ^. Consequently, integral is well defined for 
all functions F , not neceserely linear ones. This statement is independent of 

— * 

the possibility of expressing F in algebraic form. 

Generically, integration is defined for any function F: S ^ S ®{S /\S)® 
{S A S A S) (B ■ ■ ■ where S A S consists of elements of the form a Ab where 
a e S and beS^SASAS consists of elements of the form a Ab Ac where a 
, b, and c are elements of S, etc. 

Noticing the difference between dot and wedge products, generic form of 
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integral is 

J {d^xi A d^X2 ... A d^Xn) ■ F{xi, ...,Xn), (5.1) 

Here d^Xk = ^{xk)xkdxk, i{xk) is a measure with both positive and negative 
values, Xk is a unit vector in the Xk direction; and Xk — x^x^. 

Of course, in order for the above integration to be considered Grassmann, 
certain conditions need to be met: If d^x = ^[x) xdx and x = xx, where x is 
a unit vector in the x direction, then 

j d^x ' ^= j (d^a; A d^y) ■ {x ^y) = 1 (5.2) 
J d^x = J d^xAl = j [d^x A d^y) • f = (5.3) 
j [d^x A d^y) ■ fix, y)^Jd^x-(^J d^y ■ f{x, y)^ . (5.4) 

Each of the first two of the above equations is what is expected of Grass- 
mann variables. On the other hand, the last equation doesn't make sense in 
terms of standard Grassmann theory, since the expression J d9i d92 Oi 62 ~ 
f d9i (/ d^2 Oi 62) assumes that / d^2 ^1 ^2 is well defined, and the latter is 
Grassmannian. The goal of the proposed approach, however, is that all three 
expressions are equally-well-defined, among many others. 

By first trying to evaluate each of the above integrals, one can obtain 
restrictions on dot and wedge products that assure that above equalities hold: 
The equation 

= Jd^x — Jdx^{x)x — x J ^{x)dx , (5.5) 
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requires that 

J i{x) dx = 0; (5.6) 
in other words, the measure has both positive and negative values. 
Furthermore, the expression 

1 = / d> . f = /(dx . (i.) = X . i / . «x) d. , (5.7) 

can be satisfied by setting 

x-x — 1, Jx^{x)dx — 1. (5-8) 

Now consider the multiple-integral example: 

1 = y" d^x- (^J d^y-{xAy)^ = j dx$,{x) {^j dyCiv) y-{xyxA y)^ 
^x-{y-{xAy))(^J x^{x)dx^(^j |/e(y)dy^. (5.9) 

Since it was already estabhshed that 

j xi{x)dx = j yi{y)dy^l, (5.10) 

the above calculation implies that 

x-{y-{xAy)) = l, (5.11) 

which can be accomplished by setting 

y-{xAy)^x. (5.12) 

A similar argument shows that 

{y Az) ■ {x Ay Az) ^ X (5.13) 
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and 

z-{xAyAz)=yAz. (5-14) 

However, this relationship makes it a httle more tricky to define the dot prod- 
uct consistently, due to the anticommutativity of A: 

y ■ {y Ax) = -y ■ {x Ay) = -X . (5.15) 

This can be addressed by associating unit vectors with elements of totally 
ordered set, thus making a default decision between x Ay versus y Ax. Then 
the power of —1 can be used to extend the definition of wedge product to the 
reverse orders. More precisely, vectors are associated with functions on the 
totally ordered set S — {si, S2, ■ ■ ■ , 

For simplicity, define the ordering in such a way that Sj < Sj if and only 
if i < j. 

Si A S2 is a function defined as follows: 

(siAs2)({si,S2}) = l {s,As2){T)^0,T^{s,,S2} (5.16) 
Anticommutativity implies that 

(S2 Asi)({si,S2}) = -1 . (5.17) 
Remembering that {si, S2} — {s2, Si}, 

(si AS2)({s2,Si}) = 1 

(s2Asi)({s2,Si}) = -l. (5.18) 
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Furthermore, Si A S2 A S3 is defined as 

(Si AS2 AS3)({S1,S2,S3}) = 1 , (si As2)(r) =0,r^{si,S2,S3} . (5.19) 

The rule of addition of function, together with anticommutativity of the wedge, 
imphes that Asq + Ss^ A sy + Sss A sn A sg is defined as a function / satisfying 
the following properties: 

/({6}) = 4; /({5,7})=3; /({8, 9, 11}) = -8 

/({5,6,7}) = /({5,8,9,ll})= (5.20) 

= /({5, 6, 8, 9, 11}) = /({5, 6, 7, 8, 9, 11}) = 

More formally, dot and wedge products are defined as follows: 

Definition: Let pi and p2 be two polynomials over S. Then pi ■ p2 is another 
polynomial over S such that for every T G S, 

(Pi-P2)(T)= J2 PiiU)P2{V). (5.21) 

{U\V)U{V\U)=T 

The wedge product is defined as 

(pi A p2)(T) = Yl (-l)«("''')l">'' ■ ■ '^''^Pi{U)p2(V) . (5.22) 
uuv=T ■ unv=9 

Finally, the definition of the derivative requires a definition of ratio. The situa- 
tion is analogous with the set of integers where ratio is not defined everywhere 
and yet the notion of the ratio is used where it is. This is expressed in the 
following definition: 
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Definition: Let a and b be two Grassmann polynomials. If there exists a 
Grassmann polynomial c such that a A c = b then it is said that c = b/a. If 
such c doesn't exist, then b/a is not well defined. 

The important thing is that the fraction was defined in terms of the wedge 
product, as opposed to the dot product, and also that the wedge product was 
ordered in the way it was. 

The other important component of definition of derivative is the definition 
of a limit, which requires the notion of absolute values. Clearly, since v A 
V = 0, the wedge product can not be used as a definition of absolute value. 
Furthermore, since 

{l + v)-{l + v)^l + 2v + v-v^2 + 2v (5.23) 

is non-real, the dot product can not be used as a definition of absolute value 
either. 

Instead, the absolute value oiVcS®{SAS)®{SASAS)®... will be 
defined as a largest possible coefficient in its polynomial expression. As was 

— * 

done earlier, the coefficient of the A ... A term of V is identified with 
V{{ai, ...,an}) up to possible sign difference. Since the sign difference does 
not affect absolute value, this gives the following definition: 

\V\^sup{\V(T)\\T C S} (5.24) 

THEOREM: The above definition of absolute value satisfies triangle in- 
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equality 

PROOF: Let U and V be two elements of S ® {S A S) ® {S A S A S) ® . . .. 
Suppose 1^7 + V\ = c. Then for any e > there exist T G S such that 
\U{T) + V{T)\ > c-e. The triangle inequality on R implies that |c7(r)| + 
\V{T)\ >c-e. But, by definition, \U{T)\ < \U\ and \V{T)\ < \V\ Therefore, 
\U\ + \V\ > c — e Since this is true for all e > 0, this implies that \U\ + \ V\ > c, 
as desired. QED. 

However, it is NOT true that absolute value of a dot product is equal to 
a product of absolute values. This can be seen from the following example: 

1(1 + ei) • (1 - ei)| = |1 - ei • ei| = |1 - 1| = (5.25) 

yet, 

|l + ei||l -eil = 1 X 1 = 1 (5.26) 

This, however, does not compromise the possibility of defining limits since, for 
a general metric space, there is no definition of product altogether, yet limits 
are well defined. 

Based on the above definition of absolute value, the limits on Grassmann 
space are defined as follows: 

Let F be a function of the form S ^ S ® {S A S) ® {S A S A S) ® . . ., and 
suppose a e S Siud A e S ® (S AS) ® {S A S A S) ® ... Then 

\iuiF{v)^A^ (5.27) 

^ Ve > 035 > 0{W e S{{v-a)-{v-a) < 5' ^ VT e S\F{v){T)-A{T)\ < e)) 
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By substituting the definitions for limit, the definition of the derivative be- 



comes 



4^ 



Ve > 035 > o(yves(^{v- a) ■ {v - a) < 5'^ = 

){T)-A{T)\<e)) 



(5.28) 



VT e 5 



F{v) - F{d) 



V — a 



In order to make things more clean, the ratio can be removed in favor of its 
original definition. Thus, 



F(^4^)(r)-l(r) 

\ V — a / 



< e 



can be replaced with 



3d[{D A{v-a) = F{v) - F{a)) A \D{T) - A{T)\ < e) 



Thus, the definition of derivative reads as follows: 



(5.29) 



<^ Ve > 035 > o(yv e S(^(v- a)-(v-a)<5^^ (5.30) 
VT e S3d(^{D A{v-a)^ F{v) - F{a)) A \D{T) - A{T)\ < 



In light of the fact that the absolute value of a product is NOT equal to 
the product of absolute values, this can NOT be further rewritten as either 
\F{v) - F{a) - A{T) A (w - a) < e or \F{v) - F{a) - A{T) -{v-a) <e, which 
means that the above definition of derivative is the final one. 
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5.2 Vierbeins as Independent Fields 

As is the case for curved space, in the case of a general causal set a necessary 
ingredient to introduce spinor fields is the notion of vierbeins. In this thesis it 
is proposed to view vierbeins as physical fields subject to Lagrangian generator 
distinct from either gravitational or fermionic one. That Lagrangian generator 
would "classically" force vierbeins to be orthonormal to each other. This, 
together with their coupling to causal structure, will increase the likelihood of 
the causal structure being manifold-like. 

These vierbeins, however, do not match what one is used to be thinking of 
as vierbeins. In order to be able to claim that Lagrangian generator "forces" 
vierbeins to be orthonormal, it is important to also claim that such relations 
do not exist apriori. Thus, in case of four dimensional manifold, each vierbein 
has 4 (not 3) degrees of freedom and the total number of degrees of freedom 
associated with 4 vierbeins is 16 rather than 10. These will be viewed as four 
independent vector fields. These vector fields are viewed as distinct both from 
fermions and from gravity, despite being coupled to both. 

In case of general causal set, as before, vector fields can not be viewed as 
having a certain fixed number of degrees of freedom. Thus, by copying what 
was done for other vector fields, vierbeins are defined as simply a real valued 
functions on the set of pairs of points. In order to remind ourselves that these 
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are independent, we use separate letters, a, b, c and d to define them: 

a{p,q)= [ e^dx^ (5.31) 
J 'y(v,Q) 



b{p, q)= da;^ (5.32) 



c{p,q) = / e^dx^ (5.33) 

d{p, q)= da;'' (5.34) 

Intuitively, they represent t, x, y, and z coordinates of q with respect to p, 
respectively. However, it should be reahzed that the above are fields as opposed 
to coordinates, and neither coordinate system, nor manifold structure, were 
introduced by hand. Since no orthogonality is postulated, it is possible to 
take clearly one dimensional causal set, such as R, and postulate four vierbein 
fields on it; it is also possible to start out with M"^ and only postulate two 
vierbein vields. And, finally, it is also possible to postulate different systems 
of vierbeins with their own internal couplings on one and the same causal set, 
each system having different number of vierbeins, thus implying that the same 
causal set has different dimensions from itself! 

The only reason the actual causal set ends up being manifold-like is that 
the system of vierbeins is subject to Lagrangian that makes them orthonormal. 
But even then, it is possible to imagine two competting systems of vierbeins, 
one has three vectors and the other has four vectors, both subject to the La- 
grangians that make them orthonormal. If such two systems act on a random 
causal set, the result would neither approximate three dimensional manifold 
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nor four dimensional one; but, while Lagrangian densities will be a lot more 
complicated, there will be no mathematical contradiction, contrary to what 
one would expect if one did, indeed, postulate two competting dimensionali- 
ties. 

However, the Lagrangian that "encourages" vierbeins to be orthonormal 
should be "stronger" than usual. After all, in case of four dimensional mani- 
fold, despite the fact that there are 16 vierbein degrees of freedom, it should 
closely approximate the situation with only 10 degrees of freedom. While clas- 
sically this is easily done through the principle of least action, the same is not 
true quantum mechanically. 

In order to get a feel of what to do in quantum case, consider a simple 
example of one-dimensional non-relativistic quantum mechanics. The space 
degree of freedom is viewed as an analogue of one of the field degrees of freedom 
of quantum field theory. The goal is to "effectively" get rid of the one space 
degree of freedom by setting x ~ a while staying loyal to principles of quantum 
mechanics.This can be done by setting a potential 



If n is very large, the above would approximate a situation of a particle in the 
box, in which case the particle stays inside the box with 100 percent certainty. 
The purpose of |x — a|" term is to keep the particle from flying to the right, 
and the purpose of \x — a\~'^ term is to keep the praticle from flying to the 




(5.35) 



left. 
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Since the causal set theory is Lagrangian-based rather than Hamiltonian- 

based, to make analogy closer, the above can be rewritten as 



Thus, it is possible to get rid of quantum mechanical degrees of freedom by 
introducing two terms, similar to the above, into the Lagrangian. 

By viewing quantum field as multi-dimensional quantum mechanical sys- 
tem, this approach can be applied to vierbeins as well. The causal set version 
of this, however, is that instead of postulating such terms in a Lagrangian 
itself, they are postulated in a Lagrangian generator (and a choice should be 
made as to whether it is type 1 or type 2): 



where n is a very large number. 

If the above Lagrangian generator is type 1, then in order for it to be 
close to 1 at any given point p, one should be able to find an Alexandrov set 
a(p, q) such that the above expression is close to 1 for every single pair of 
points selected inside of this Alexandrov set. On the other hand, if it is type 
2, any given pair of points can violate that relation, as long as on average it 
is 1. For this reason, I personally think that type 1 is preferable for vierbeins. 
But it is up to future numerical work to see if type 2 would still reproduce the 
predictions of quantum field theory. If so, then type 2 might be better in the 




(5.36) 
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sense that vierbeins would seem less "forced" . 

Of course, either of the two Lagrangian generators select "minimizing" 
Alexandrov set and only affect what happens in its interio. This, however, 
matches what is expected of vierbeins. Due to lightcone singularity, in any 
given coordinate system, most of the points of the e-neighborhood of a point x 
are arbitrary close to lightcone and, at the same time, arbitrary far away from x 
coordinate- wise. Thus, the infinitesimal behavior of vierbeins is not something 
that happens in every single Alexandrov set, but only at a specifically selected 
one. This, does not affect how closely manifold-like it might get inside that, 
relevent, Alexandrov set, as illustrated by the fact that the approximation 
would hold for every single pair of elements of the latter, without fail, in 
type-1 scenario. 

Strictly speaking, this does not imply that causal set is manifoldlike. After 
all, for any given causal relations it is possible to adjust a, 6, c and d point-wise 
in such a way that the causal structure given by — 6^ — — cP matches 
the actual one. However, the more manifoldlike a given structure is, the more 
there are such ways to do that. This means that there will be more identi- 
cal copies of the same causal structure in a path integrals, since all possible 
choices of vierbeins corresponding to each copy has to be counted separately. 
This amounts to manifoldlike causal structures to have higher weight in path 
integral than non-manifoldlike ones. 

However, the fact that manifoldlike structures are overcounted should be 
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balanced against the fact that they are very rare. It is up to future numerical 
work to see whether or not the total contribution of all manifold-like structures 
is greater or smaller than the total contribution of non-manifoldlike ones. This, 
of course, is also related to exactly how the notion of manifoldlike is defined. 
For instance, while a very thin cylinder can be viewed as a manfifold, the causal 
set produced by sprinkling of points onto that cylinder is not manifold-like. 
Exploring these issues, however, is beyond the scope of this thesis. 

5.3 Vierbeins and Symmetry Properties of Fermionic 
Fields 

The next step of the theory is to define symmetry relations between vierbeins 
and fermionic fields. In Ref ^ (which, for the purposes of general interest, 
will be summarized in the appendix to this chapter) it was shown that, in four 
dimensional manifold, in a toy model where spinor fields commute, one can 
"trade" 2*4 = 8 fermionic degrees of freedom with the 8 out of 10 vierbein 
ones. Thus, at any given point one can select a coordinate system in such a 
way that a fermionic field is a superposition of spin-up particle and spin-up 
antiparticle, with real coefficients. These two coefficients can be viewed as two 
real scalar fields, while the choice of frame can be viewed as four orthonormal 
vector fields. A spinor field can be viewed as the combination of these scalar 
and vector fields, which would contain 2-1-6 = 8 degrees of freedom. According 
to the above model, there is no such thing as spin 1/2 fermion. Rather, there 
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are two spin fields and four spin 1 fields coupled to each other. 

However, this approach does not work in light of the fact that vector fields 
are commutting while fermionic fields are anticommutting, which means that 
they can not be identified with each other. True, in the previous chapter 
Grassmann numbers were defined as individual elements of real vector space. 
However the price for doing this was the inclusion of ^-measure. In case of 
several Grassmann numbers, ^-measure is defined as 

m-On)=m)-^iOn) (5.38) 

This is not invariant under rotation of 6-s. One can experiment by defining 
multi-dimensional C, function that can not be expressed as above product, but 
still satisfies 

j dei...d9n^{0i, en)ei...en = i (5.39) 

with all other integrals being 0. But it is easy to see that in order to satisfy 

J dei...denm,-0n) = o (5.40) 

^ has to have both positive and negative values, which means that one has to 
"choose" exactly where in the vector space is a "transition point" where ^ = 0. 
This choice will again violate the rotational symmetry of that space. 

For thses reasons, the idea of [S] has to be dismissed for the case of anti- 
commutting fermions. Instead, all the vierbein degrees of freedom has to be 
viewed as completely separate from fermionic ones. There is no such thing 
as symmatry transformation between fermionic field and vierbeins. The field 
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configurations that we would normally say are related by symmetries, are now 
viewed as completely different from each other, and the fact that their La- 
grangian densities are the same is viewed as completely coincidental. 

This, of course, means that there is no formal justification in attempting 
to introduce Fadeev-Popov ghosts or use other techniques designed specifically 
for symmetries. Instead, there will be a lot of overcounting resulting from the 
fact that Lagrangian densities "happened" to be the same at too many points. 
Infinities will be avoided by restricting path integration to large but finite 
range. 

Quite independently from the above, there is a separate source of extra 
degrees of freedom: as explained in section 5.2 in a lot more details, vierbeins 
are viewed as four independent vector fields that, apriori, are neither orthogo- 
nal to each other nor have norm 1. Thus, in case of four dimensional manifold, 
they contribute 16, rather than 10, degrees of freedom. At the same time, La- 
grangian generator is introduced that "enoourages" them to be orthonormal 
and that generator is "strong" enough to make it "effectively" appear as if 
there are only 10 degrees of freedom. 

Strictly speaking this means that there is no such thing as spin 1/2. In- 
stead, there are four independent spin 1 fields and two independent spin 
fields, the former commute and the latter anticommute. They are coupled to 
each other through various Lagrangian generators, which results in an appear- 
ance of spin 1/2 field which is not really there. 
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5.4 Lagrangian Generator for Type-1 Fermions 

As a consequence of the fact that Grassmann numbers are well defined outside 
of integral, the theory about going from Lagrangian generator to Lagrangian 
is well defined for Grassmann numbers. The next step is to find out what 
fermionic Lagrangian generators actually are. The goal is to come up with a 
Lagrangian generator that would, in the continuum limit, generate 

L = i^^^e^^d^i, + ^7'"<^"V(e^^,e^(a^e,, - d^e^^) + e^e^a.e^) . (5.41) 

This can be rewritten as 

L = iT^ie^a^^^^ + V^7"^<7"V(e^;,e^(a^e,, - d^e,^) + e^e^^.e^) . (5.42) 

Vierbeins are viewed as four separate vector fields, whose orthonormality is 
a consequence of Lagrangian generators introduced in sections 5.4 and 5.5. 
Similarly to what was done for a gauge field, vector fields are associated with 
scalar fields of pairs of points: 

e„(r, s) = (s^ - r^) (e„^(r) + + duCa^.) + 0{t^) . (5.43) 

The expressions that need to be redefined for causal set scenario are A^B^dyC^ 
as well as A'' B^. 

Noticing that 

Ai'B^ = (A + BY{A + B)^ - A'^A^ - Bi'B^ (5.44) 
the Lagrangian generator corresponding to this expression is 

^1 = {^11, ^12, -^13, -^14, -^15, -^le} (5.45) 
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where (/Cii,/Ci2) correspond to {A + B)^^{A + B)^^ (/Ci3,/Ci4) correspond to 

A'^A^ and (/Ci5,/Ci6) correspond to B^^B^. 

For any fixed point p, a(p, r) and r) can be thouglit of as scalar func- 
tions of r alone. Therefore, from section 3.3, they are given as 

/Cii(a, q) = k{a{p, q) + , P ^ g. 

/Ci2(a, = —k{a{p, q) + b(p, q)Y , p and 5 are unrelated. 

/Ci3(a, g) = ka?{p, q),P^q- 

/Ci4(a, Q') = —ka^ip, q) , p and q are unrelated. 

/Ci5(a, b,p, q) = A;62(p^ ^) , p ^ q. 

/Ci6(a, = —kb^{p, q) , p and q are unrelated. 

As was seen in the scalar field section, the same equation applies both to 
spacelike and timelike gradients of scalar field. Thus, the above question will 
be the same if A^ is replaced with one of the spacelike vectors. Therefore, 

/Cii(6, c,p, q) = k{b{p, q) + c(p, q)f ,p^q 

K-uib, c,p, q) = —k{b{p, q) + c{p, q)Y , p and q are unrelated 

/Ci3(6, c,p, q) = kb^{p, q),p -<q 

/Ci4(6, c,p, q) — —kb'^{p, q) , p and q are unrelated 

/Ci5(6, c,p, q) = kc^{p, q) ,p^q 

/Ci6(6, c,p, q) — —kc^{p, q) , p and q are unrelated 
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A slight modification of tlie above defines the value of A'* 9^0. Namely, 
b{r,s) should be replaced with — 0(r), which means replacing i?^ with 
9^0. These substitutions produce 

/Cii(a, q) = c(a(p, q) + (f)(q) - (f){p)f ,p^q 

/Ci2(a, (f),p, q) — —c{a{p, q) + (t){q) — (t>{p)Y , p and q are unrelated 

/Ci3(a, 0,p, q) = ca'^{p, q) , p ^ q 

/Ci4(a, 0,p, q) — — ca^(p, g) , p and 5 are unrelated 

/Ci5(a, g) = c(0(g) - ^(p))^ , p ^ 5 

/Ci6(a, 0,p, q) — —c{(f){q) — (pip))^ , p and q are unrelated 

Again, the same equation applies if is replaced with something space- 

hke: 

^11 0,P, q) = c{b{p, q) + 0(g) - (t){p)f , p -< q 

/Ci2(6, 4>,p, q) = —c{b{p, q) + 0(g) — 0(p))^ , p and g are unrelated 

/Ci3(6, 0,p, g) = c62(p, q) ,p~<q 

/Ci4(6, 0,p, g) = —cb^ip, q) , p and g are unrelated 

/Ci5(6, 0,p, g) = c(0(g) - 0(p))2 , p ^ g 

ICiQ{b, 4>,p, g) = —c{4>{q) — 4>{p)y , p and g are unrelated 

Now let us try to get A^" d^C^. 

Inspired by the similarity in structure with electromagnetism, a good start- 
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ing point is to attempt to maximize the fluctuation of a{r, s) b{r, s) c(r, s) d{r, s) . 
Luckily, it is possible to simultaneously minimize the fluctuation of each of the 
four individual multiples, which would automatically minimize the fluctuation 
of the product. Based on the results of the scalar part, if the gradient of (p 
is timelike, in order for the fluctuations of to be minimized its gradient has 
to be parallel to the axis of the Alexandrov set; on the other hand, if the 
gradient is spacelike then in order for the fluctuations of to be minimized, 
that gradient should he along the equator of the Alexandrov set. This means 
that in order for fluctuations of a(r, s) to be minimized, A^^ should point along 
the axis of the Alexandrov set, while in order for fluctuations of b{r, s), c(r, s) 
and d(r, s) to be minimized, then S'*, and D^^ should he on the equator 
of the Alexandrov set. But the orthogonality condition implies that, not only 
these four statements are compatible, but in fact if the condition about the 
timelike vector is met, it forces the conditions about the three spacelike 
vectors B^, and to be met as well! This means that minimizing the 
fluctuations of a product a(r, s) b{r, s) c{r, s) d{r, s) implies minimizing each of 
the four multiples individually, which completely specifies the paramenters of 
the Alexandrov set. 

Now select points ri, r2, and r4 in a way that minimizes the fluctuations 
of a(ri, 6(r2, r4). Again, each of these two multiplets is maximized sepa- 
rately. It is easy to see that a(ri, r^) is maximized under ri = p = (— | r, 0, 0, 0) 
and — q — {^T, 0, 0, 0) while 6(r2, Vi) is maximized under r2 = (0, | r, 0, 0) 
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and r4 = (0,—|t, 0,0,0) These two conditions are comparable with each 
other. Now suppose that we instead decided to maximize the fluctuations 
of &(ri, ra) c(r2, r4). Again, fluctuations of each of these flelds can be max- 
imized simultaneousle. This time, since both of these flelds have spacelike 
gradient, each one will be maximized by selecting points on the equator. 
In particular, to maximize 6(ri,r3), set ri = (0, — | t(p, g), 0, 0) and — 
(0, 1 r(p, q), 0, 0) and in order to maximize c(r2, r^) set r2 = (0, 0, | r(p, q), 0) 
and r4 = (0, 0, — | t(p, g'), 0). Again, this results in the same square loop 
that was used in the gauge case. Noticing that the square loop for the case 
of one timelike and one spacelike fleld is the same as the square loop for two 
spacelike flelds, except that t axis was replaced by the y axis, it can be summa- 
rized that u{ri,r3) v{r2,r4) is maximized under U^r\^ — — |t, U^r^n = \ 
V^r2ij, — \ t, V^r^^j, — —\t regardless of whether the flelds are spacelike or 
timelike. 

Now if w is a third holonomy, then, remembering that all vectors are unit 
vectors, the results from the gauge part can be rewritten as 

w(ri, r2) + w{r2, ra) + w(r3, n) + w(r4, n) = {dpW^ - d^Wp) . (5.46) 

where p and a are directions corresponding to U and V. Remembering that 
U and V are of unit length, this can be rewritten as 

w{ri,r2)+w{r2,r3) + w{r3,n)+w{u,ri) ^T^U^V {d^W,-d,W^). (5.47) 
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Now C/^ Vl^^ = implies that 

V d^Wi, = -W^ V'd^U^, . (5.48) 
Thus the above expression becomes 

w(ri, + ■u;(r2, ra) + w{r^, r^) + w(r4, ri) = 

= t2(p, g) a^w^, + w v d,u^). (5.49) 

As was stated in the previous paragraph, the above was done under the condi- 
tion of maximization of u{ri,r^)v{r-i,r^). In fact, this is the only dependence 
on u and v which means that if it was taken away, then and would no 
longer make sense on the right-hand side of the equation. 

To make this formally consistent with the generic prescription of La- 
grangian generators, it is necessary to write that maximization explicit, within 
the framework of the former. The problem with this is that the selection of 
points is chosen to maximize one expression, ^(ri, r3)v(ri, rs)), while the ex- 
pression being evaluated is different, w{ri,r2)-\-w{r2-ir^)-\-w{r^,r^-\-w{r4^,ri). 
In order to be consistent with the Lagrangian prescription, these two ex- 
pressions should be the same. This is accomphshed this by replacing both 
of them with {u{ri, rs)v{ri, rs))^ (w{ri, -l- w{r2, rs) + w(r3, r^) + w(r4, ri)) 
where n is a very large number. If n is large enough, the maximizing of 
above expression would approximately mean maximizing u{ri,r3)v{ri,r3). At 
the same time, if the above maximization occurs, then the approximation 
'^('"1)^3) ~ v{ri,r3) ~ T will hold. Thus, the only non-trivial expression left 
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will be w{ri, + 'w{r2, rs) + w{r3, r4) + w{r4, ri), as desired. 
Thus, the equation can be rewritten as 

min max (u{ri, r3)v{ri, r3))"(w(ri, + w{r2, r^) + w{r3, n) + w{r4, ri)) 

t(p,9)=to ri,r2,r3,r4Ga(p,g) 

= t2(i-") (p, q) {U^ d^W^ + W d^U^) . (5.50) 
We can now permute this equation to get another two equations, 

(w(ri, r3)w(ri, r^)) '\u{ri, + u{r2, r^) + ^(ra, + M(r4, n)) 



mm max 

''"(Pi9)=''"0 ri,r-2,r-3,r-4ea(p,g) 



t'^'-^Hp, 9) (^'^ d^Uu + C/'^ W^'^ , (5.51) 



and 



min max (w(ri, r3)u{ri, r3))"(t'(ri, r2) + v{r2, r^) + ^(ra, r4) + v{ri, ri)) 

T(p,g)=To ri,r2,r3,r4ea(p,g) 

= g) (^^/^ [/^ d^V^ + l/'^ t/'^ 9^H//,). (5.52) 



Subtracting the second equation from the sum of first and third, and then 
dividing the whole thing by 2, leads to 

~ i f^) { ^^-^ ('"(''i, '''3)'«^(''i, '''3))" X (5.53) 

\Vq/ \ T{p,q)=To ri,r2,r3,r4,ea{p,q) 

x{w{ri,r2) + w{r2,r3) + w{r3,r4) + w{r4,ri)) + 

+ min max {v{ri, r3)w{ri, r3))"(ii(ri, r2) + u{r2, ra) + u{r3, r^) + u{r4, ri)) - 

T{p,q)=To ri,r2,r3,r4&a{p,q) 

- min max (^(ri, ra)w(ri, ra))"(t'(ri, r2) + v(r2, ra) + ^(ra, r4) + t'(r4, ri)) ) , 

Tip,q)=To ri,r2,r3,r4€aip,q) / 

which is the desired term. 
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The Lagrangian generator for the above term is {/Ci, /C2, — /C3}, where 

]Ci{u, V, w; ri, ra, rg, = (5.54) 

= {u{ri,r3)v{ri,r3)y{w{ri,r2) + w(r2,r3) + w(r3,r4) + w(r4,ri)) 

/C2(m, V, w; ri, r2, rs, = (5.55) 
= {v{n,r3)w{ri,r3)y{u{ri,r2) +u{r2,r3) +u{r3,r4) +u{r4,ri)) 

K,2.{u, V, w; ri, r2, rs, r4) = (5.56) 
= (w(ri,r3)w(ri,r3))"(u(ri,r2) + v(r2,r3) +i^(r3,r4) + w(r4,ri)) 
We can now rewrite the fermionic Lagrangian as 



£ = iTj^re^M, + {^^a^'^ - ^7V™V + ^7'"^'^V) ete'^d^Cm. ■ (5.57) 



This can be expressed in terms of the following Lagrangian generators: 

A:r^(V', 60,61,62, eg; p,g) =7]^Vi(p)A^i(em,'0j;P,?) (5.58) 

/Cf '"(^, 60, 61, 62, 63; ri, r2, ra, r4) = (5.59) 
- (^(ri)7"^a"V(ri) -^(ri)7V'"V(ri) +^(ri)7"^a«V(ri))x 

X {A:i,A:2,-A:3}(66,6„, 6^;ri,r2,r3,r4) (5.60) 

/Cfermionic = {/C^^'^ | {m, i, j} C {0, 1, 2, 3}}U 

U{/C2"^"^|{m,i,i}c {0,1,2,3} (5.61) 
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5.5 Lagrangian Generator for Type-2 Fermions 



Lagrangian for spinor field is given by 



L = ^Pj'^d^iP + V^7V'^V(e(;e^(a^e,, - d,e,^) + e^e^^^e^) . (5.62) 



As before, e^, ej,, Cc and are viewed as four separate vector fields, represented 
by four separate sets of holonomies. However, the information about orthogo- 
nality of these fields will be used, which might either be enforced through the 
separate type-1 Lagrangian generator for vierbeins talked about earlier or else 
through some Lagrange multiplier terms. 

We would like to compute separate type-2 Lagrangian generators for each 
of the key terms of the Lagrangian. 

Let's start with the Lagrange multiplier terms for vierbeins. Since these 
terms are structually similar to the scalar field Lagrangian, lets try to borrow 
our results from the latter in order to save ourselves some work. 

Prom the section on type-2 scalar fields, we know that their Lagrangian 
generator is {J^s, f,9) where 



/:(0,E,p,g) = A;dT2'^(p,g)(g^-p'^)(g'^-p'^)a^0a,0(A;d + 2Ed(/do + /dl(c^-l)))- 



J,(0,r,s)=(0(r)-0(s)f) 



(5.63) 



and corresponding pre-Lagrangian is 



2kaEahiT^''^\d-l)d^cl>d^<t> 



(5.64) 
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Replacing 0(r) — (f){s) with (r'* — s^)Vn it is easy to see that pre-Lagrangian 
corresponding to 

J(r, s) = v^{r, s) (5.65) 

is given by 

jC{<P, E,p, q) = kdT'"'{p, - - p^vM^d + 2E,{Iao + hi{d - 1)))- 

- 2kaEahiT^''^\d - 1)^% (5.66) 

While this is not necesserely true about actual Lagrangian, it is easy to see 
that pre-Lagrangian depends linearly on type-2 Lagrangian generator. Thus, 
by using 

u{r, s)v{r, s) — -{{u{r, s) + v{r, s))^ — u^{r, s) — v^(r, s)) (5.67) 

and 

^\{{^^^ + ^m) - - (5-68) 

the pre-Lagrangian corresponding to 

Jiu^v^r^s) — u{t^s)v{t^s) (5.69) 

with the same / and g as earlier is 

£(M,^;,E,p,g) = fcdr2'^(p,g)(g^-p^)(g--p>^^;,(A;d + 2Ed(/do + /di(ci-l)))- 

- 2kdEdhir'"'^\d - 1)m% (5.70) 

It is also easy to see that all coefficients are the same as the scalar case, so 
the solution for E gives us identical result as it did back then which, upon 
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substitution, gives us the same answer as for scalar case, if is replaced 

with U^^V^ and mass term is dropped: 

C = /'ff~ ^\y '^'U^V, (5.71) 

JdO + -'(ill," — i-) 

We would also hke to produce Uf^Ufj, + 1 and U'^Uf^ — 1. By inspecting the 
mass term of the Lagrangian generator for scalar field, while dropping both m 

and (po, it is easy to see that if Lagrangian generator is set to constant, 

J{r,s) = l (5.72) 

with / and g defined in the same way as before, the corresponding pre- 
Lagrangian is 

J(^const{E,p, q){l + E)kjr^''{p, q) (5.73) 

Again, since the value of E is identical to the one in scalar case, the actual 
Lagrangian is also identical to the mass term of scalar Lagrangian, where 0o 
and m are dropped: 

In order for the sum of the two Lagrangians to be proportional to V^Vdn + 1, 
Vd has to be defined as 



The + and — signs are taken into account by replacing J — 1 with J — —1. 

As far as ■07"e^9^V goes, we can factor out ■07" and write a Lagrangian 
generator for e^(?^0. We can do that by copying what we had for u^v^, substi- 
tuting u'^ for and d^%l) for This substitution amounts to replacing u(r, s) 
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with ea{r, s) and replacing v{r, s) with ip{s) —ip{r). Thus, the Lagrangian gen- 
erator for kinetic term with constant vierbeins is 

>Cflat,kin('0,r,s) = ip-f°'ea{r,s){ip{s) -ip{r)) . (5.76) 

Now let us move to type-2 Lagrangian generator for e'^eld„ecij, terms. 
The plan is the following: 

PART 1: By using orthonormality of e^, show that ej^ t}^ dv^c\x can be ex- 
pressed as linear combination of the terms of the form £^^^71 = ^i^ {^ii^nv — 
Qv^-np)- This would simplify the situation tremendously since the latter some- 
what resembles gauge theory which was already done. 

PART 2: Find type-2 Lagrangian generator for E^hc in a coordinate- free 
setting of causal set. Even though, as remarked above, the resemblence to 
guage theory should make it easy, there are still difference with gauge theory, 
including the fact that there are 3 holonomies rather than 1, which makes it 
somewhat difficult. But orthonormality of these three holonomies will be used 
in passing through this. 

PART 1 

If the above expression is expanded and the dummy indices // and v are 

switched on the second term, this gives 

Eahc = f^a^ f^h d^e^^ - ea" d^c^u . (5.77) 
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Apply dfj_{ea^ ecu) ~ ^i^Vab = to the second term of above equation gives 

Eabc = Cb'^ 9^ec^ + eb'^^^e^^ . (5.78) 

Permuting the indices gives 

Ebca = d^eau + e/ e/d^eb, (5.79) 

Ecab = e/ d^Cb^. + e^^ e/^^e^^ . (5.80) 

Prom these expressions it is easy to see that 

Eabc + Ebca — Ecab — 2 Cc" tb^ ^ix^a ■ (5.81) 

Switching a and c and dividing the expression by 2 gives 

eb^ a^e^ = ^ (£;e6a + Ebac - Eacb) ■ (5.82) 

Thus, the problem of computing Ca'^ eb^ ^ii^c reduces to the problem of com- 
puting Ecba for causal set, as desired. 

PART 2 

Try Lagrangian generator (JT", /, g) where 

J{u^ V, w) = u{t, r)v{t, s){w{r, s) + w{s, t) + w{t, r)) (5.83) 

/(n, ^2, rg, r4, rg) = (ri, rg, r4) , g'(ri, r2, rg, r4, rg) = (rg, r4, r*) (5.84) 
The pre- Lagrangian corresponding to this generator is given by 

C{u,v, w, E, p,q)^ J d'^rd'^sdH{Jip, r, s) + J{q, r, s) + 2EJ{r, s,t)) ^ 

^(^J dH^ J d'^rd'^s{J{p,r,s) + J{q,r,s)) + 2E J d'^rd'^sdHj{r,s,t) = 
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= kdT'^ J d'^rd'^s{J{p, r, s) + J{q, r, s)) + J d'^rdVtJ{r, s, t) (5.85) 

In the linear case that satisfies time reversal symmetry, the pre-Lagrangian 

becomes 

jC{u, v,w,E, p, q) — (5.86) 

= 2 (^kdT'^ j d^rd^sJ{p, r,s) + E J d^rd^sdHj{r, s, tj^ 

Let C/^, and be vector fields corresponding to u, v and w . As usual, 
set a coordinate system in which t axis passes through p and q, which are the 
end points of Alexandrov set a{p, q). In this coordinate system, 

p = (-r/2, 0, 0, 0) , g = (r/2, 0, 0, 0) (5.87) 
From intuition we have from electrodynamics, 

w{r,s) + w{s,t)+w{t,r) = {r'' - t^){s'' - t''){d^W, - d,W^) (5.88) 
Thus, in the above coordinate system, the first integral is given by 
d'^rd'^su{p, r)v{p, s){w{p, r) + w{r, s) + w{s,p)) — 

+ ^^S'As^s^ + "^WK) (5.89) 

By antisymmetry of d^jWi, — d^Wfj,, /i and u can never be equal in terms that 
survive the cancellation. Thus, in the second term inside the integral, the fact 
that ijl — {as inferred by Sq) implies that u — k ior some A; > 1. In order for 
that term to survive integration, the indices of two r-s have to match which 
means that p — /i — k. 
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a(j>,q) 



As far as the third term is concerned, the situation is the opposite: since 
/X = 0, we set u = k. In order for indices of s-s to match, it means that u = k 
as weU. Finally, as far as the last term is concerned, due to 5q5q, the coefficient 
outside the integral becomes doWQ — doWQ — which means that the last term 
is dropped, out. 

Thus, the integral becomes 



/ 



d rd su{p, r)v{p, s){w(p, r) + w(r, s) + w(s,p)) — (5.90) 

9) 

= r'^+^UoVkidoWk - dkWo)hohi + UkVo{dkWo - doWk)him+ 
After some simple algebra this becomes 



f d" 

J a(p,q) 



rd'^su{p, r)v{p, s){w{p, r) + w{r, s) + w{s,p)) ~ 

= T^^^^hiiho + ^){doWk - dkWo){UoVk - UkVo) (5.91) 
Now lets move to the other integral, 

J d'^rd'^sdHu{r, s)v{r, t){w{r, s) + w{r, t) + w{t, r)) = (5.92) 

= UpVaid^W^ - d^W/,) J d'^rd'^sdHis" - r''){r - r")(s'' - r''){t'' - r"") 
Let's expand out {s^ — r^){t^ — r^). It is easy to see that 

ri^r^id^W, - d,W^) (5.93) 

which means that r^r*^ term can be dropped. Thus, the integral becomes 

))^ (5.94) 
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Upon further expanding all of the parentheses, only the even terms survive 
integration. Thus, the integral becomes 

= U,V„{d^W, - d,W,) [( j rf'^r) ( j d'^ssPs'^) ( j dHrf^+ (5.95) 



+ ( / d'^rr^r'']i I d'^ss''s>']i I d'H] + ( I dV^r^)f / d'^s]i I d'Hrt" 



Upon inspection of the above expression, in order for the terms to be even, 
we need p — and a — i/. Upon substitution of these and doing some simple 
algebra, the integral becomes 

d'^rd'^sdHu{r, s)v{r, t) (w(r, s) + w{r, t) + w{t, r)) = 

= Skdhdr""^^ [lod ^(C/ol4 - UMidoWk - dkWo)+ (5.96) 

k 

Combining the two integrals tells us that the pre-Lagrangian is given by 



jC{u,v,w,E,p,q) [2(^{l+3E)I^+^)(doWk-dkWo)(UoVk-UkVo) 



+ 



(5.97) 



In order to minimize variation of pre-Lagrangian we have to, as usual, adjust 
E in such a way that non-covariant contribution cancels. This means 



(5.98) 
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which gives 

T -I- hd. 

Ea = - 3 ) ■ (5-99) 

Substituting the above E^, into pre-Lagrangian gives the Lagrangian 
h (I -\- — ) 

^ ^ — d diK do T) (^jjf.y. _ u-v''){df,W, - d^Wf,) . (5.100) 

-'do + Idl 

Unhke the bosonic cases, we can not scale U, V and W to adjust the coefficient 
because these fields, being a vierbein, need to have norm 1. Instead, we take 
advantage of the fact that the above terms are all multiphed by t/j^i^t/; and 
scale ip. 

In light of the fact that ip is Grassmannian, while scaling -0, we have to 
take into account the ^ measure that ip is subject to. From the case of one 
Grassmann variable we know that, after the re-scaling 

0a (5.101) 

9d = (5.102) 
UO) = CjaCde) (5.103) 



we still have 



(5.104) 



and 



ed-ed = i (5.105) 

This means that we can proceed with re-scaling if) in the same way as we did 
for bosonic case. 
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However, the situation is complicated by the fact that there is also ■07°e^9^V 
term, for which re-scaling is very different. This is adressed by using sim- 
ilar idea to type-1 case where total Lagrangian generator is represented as 
J — {J'l, J^2, ■■■) as opposed to — J^i + ^2 + ■■■ and separate these two terms 
as separate terms in Lagrangian generator. 

In this case, for the fermion will depend on whether we view mass 
term as independent one or whether we combine it with one of these other 
terms. Whatever is the case, if ip is replaced with ipd = ip/Cd, then m^ipip is 
replaced with C^m^ipdj/jd. Since, when it comes to mass term, both in scalar 
case and in fermionic case the field is considered to be a constant, we can copy 
the answer from fermionic case with appropriate adjustments, which gives us 



CV*;^..^-(l-^^— ^1— ^) (5.107) 



2{Ido + Idi{d-l)). 



which implies 



™. = m*.r^C.Jl-^(^— |L_ (5.108) 



where Cd can have different values depending on whether mass term is com- 
binded with the derivative terms of ip or with derivative terms of vierbeins or 
whether it is viewed as independent from both. 
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5.6 Appendix: Geometrical Interpretation of Non-Grassmanian 
Fermions 

In section 5.3 it was mentioned that if fermions were commutting, it would 
have been possible to replace 6 of the 8 fermionic real degrees of freedom 
with the degrees of freedom associated with vierbeins by pointwise selecting 
reference frames in such a way that spin-down particle as well as spin-down 
antiparticle components of fermionic field are point by point. 

What prevented us from doing it was the fact that fermionic field anticom- 
mutes while vierbein field commutes. Prom a different angle, the Grassman- 
nian fermionic field is subject to ^ measure, while vierbeins are not. However, 
it would be of mathematical interest to explore a toy model in which fermions 
commute and thus the above mentioned concerns do not apply. This will be 
the subject of this chapter. Since this toy model does not apply in real life, 
this chapter is for mathematical interest only and can be skipped as far as the 
rest of the thesis is concerned. 

If we have an arbitrary spinor at a point, we can always rotate it into a 
state of the form Xp'^i + Xa^i (here "p" stands for particle, and "a" stands for 
antiparticle). This can be seen by counting degrees of freedom. The rotation 
group in 4 dimensions has 6 degrees of freedom, while multiplication by an 
arbitrary complex scalar adds 2 degrees of freedom. This means that if the 
actions of these two groups were independent, we would obtain a total of 8 
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real degrees of freedom, which matches the number of real degrees of freedom 
in a 4-spinor. 

In light of the above, I would like to get rid of the notion of fermion in 
favor of more "geometrical" quantities, which are: 

(1) 6 orthonormal vierbeins which define local frame in which spinor has a 
form Xp'^i + Xa'Vi, where both Xp and Xa are real. 

(2) Xp and Xa (see above). 

Thus, I will rotate the reference frame from point to point in such a way that 
the fermionic field is always in the desired form. 

In order to stress the fact that vierbeins are now viewed as fields, I will 
replace Bq, e^, 63 and 63 by A^^, B^, and D^^ respecively, and introduce 
Lagrange multipliers to assure that 

= 1 , B'^B^ = C'C^ = D^D^ = -1 , (5.109) 
A^B^ = A'^C^ = A'^D^ = Bf'C^ = B'^D^ = C'D^ = (5.110) 

(I am using a metric of signature (+,—,—,—)). We will then relax the assump- 
tion about rotation of reference frames and go back to the flat Minkowski case. 
Thus, the final form for my notation for a spinor will be {A^^, B^^, C, D^^, (f>, x). 

The geometrical model I propose has also an intuitive appeal: if we take 
the word "spin" literally and imagine a particle spinning, we would need to 
know the plane in which the particle spins. This gives us two axes, which are 
described by two vectors, B^^ and C^. Now, since spin is subject to Lorentz 
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transformations, we also need to know the rest frame of the particle, and this 
is determined by a timelike vector A^^. As far as is concerned, due to the 
orthogonality and unit-norm conditions, it is completely determined by the 
above 3 vectors. As you will see from the results of the "Lagrangian" section, 
things can indeed be visualized in terms of spinning. 

The biggest objection one can have is that vector fields and fermions 
have different transformation properties. However, if one realizes that the 
transformation properties are completely determined by the Lagrangians and 
inner products, we can cure the problem by drawing attention towards the 
latter two, and away from the transformation properties. For example, the 
implication of spin-| is that a 360° rotation in vector space is the same as a 
180° rotation in spinor space. This problem can be cured by redefining what 
we mean by a rotation: Instead of simply using U h- > M{9) U, where M{9) 
is the usual rotation matrix, we use U i— > e'^/^ M(9) U ; by adding a phase, 
the complex amplitude switches sign upon a rotation by 6* = 360°, despite the 
fact that the vectors are rotated back to their original positions. The reason 
for this feature is that SU(2) is not the full symmetry group; rather, the full 
symmetry group is SU(2) x U(l). This gives us the freedom of selecting a 
subgroup R of SU(2) x U(l) such that R x U(l) = SU(2) x U(l). Any such R 
can be used as a definition of rotation group, and the freedom of choosing this 
R corresponds to a freedom in defining the value of the spin: spin-| x spin-0 
= spin-1 X spin-0. Another example: suppose we perform a 180° rotation in 
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the space of vectors. In this case, the fact that vectors determine a coordinate 
system doesn't stop us from defining the inner product between two flipped 
coordinate systems to be instead of —1. After all, we can define the inner 
product any way we like, so we chose to do it this way. These two features 
will be implemented in the remainder of the section. 

Throughout this section I will use the following representation: 



] 

V 



1 
-1^ 

where 1 is the 2x2 unit matrix, a'' for k = 1,2,3 are the Pauli matrices, and 
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the basis column state vectors will be defined as follows: 
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Even though in this section we are only dealing with a toy model in which there 
are no Grassmann numbers, we are still free to get rid of and terms of the 
Lagrangian. This means that as far as spin connection terms are concerned, we 
are looking only at XvXa terms. Based on the fact that spinors take the above 
form, it is apparent that the only term of Lij^ah'^^^a°^i\) — Umab'^^l^l^o-"'^'^ 
that survives is the one where ^^^"^a"^ is off-diagonal matrix in the 2x2 
block representation. This will happen only if m, a and b are all non-zero, 
which identifies them as 1, 2 and 3 up to permutations, which means they are 
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all proportioanl to ip^^^ip — 2xpXa- As far as derivative terms, we do have 
to keep ippdipp and ipadipa terms as well as we still have to keep the "mixed" 
ones. This means that we are looking both at the diagonal and off diagonal 
matrices in block diagram. However, since there are no spin down components 
of either particle or antiparticle, each block needs to be diagonal. The matrices 
that satisfy these constraints are 7° and 7^ . 7° will give us t/j^'^e^^dnt/j — 
e^^iXpdnXp+XadmuXa) and 7^ will give us ip'^^e^^'d^ip = e^^ixpdixXp-XadmuXa) 
Thus, the Lagrangian becomes 

Cfree = fc^^7'^('^23 ' '^is + ^'12) + ?e°'^7o5M^ + i^e^^l^d^i^ (5.111) 
= 2A;XpXa(^23 - ^13 + ^12) + ^^''{XpdnXp + Xa^^iXa) + e^^{xpd,,Xa - Xad^Xp) 

Finally, in order to stress the fact that vierbeins are viewed as fields, we 
will replace through e^^ with A^^ through D^^ respectively, and introduce 
Lagrange multipliers to enforce orthonormality. We will also replace (^23 with 
cuqjj and do similarly with all the other indeces. These a;-s are now functions of 
our vector fields that are defined based on formal substitution of these in place 
of Vierbeins without making an assumption of orthonormality, since the latter 
is only a consequence of Lagrange multipliers. Thus, Lagrangian becomes 

Cfree = '^kXpXai^CD " ^BD + ^Bc) + ^^{Xp^^,Xp + Xad/^Xa) + D^'iXpdf.Xa - Xad^Xp) 

+ Ai (AM^ - 1) + A2 {B'^B^ + 1) + A3 (C^C^ + 1) + A4 {D'^D^ + 1) 

+ As A^^B^ + Ae A^C^ + A7 A^D^ + Ag B^C^ + Ag S'^D^ + Aio C^D^ , (5.112) 
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where 

uj'^w = U^'Vid^W, - d,W^) + VWd^Up (5.113) 

Now I would like to introduce interaction terms into Lagrangian. Since it is 
possible that we have interaction of more than one fermion, I would like to be 
able to define and A/j^^'f'^i/j. In general, this means I would like to define 
^A^ . Suppose vierbeins that are needed to put ^ in the form Xpi^i + XaVi are 
Cq = A^^ through 63 = D'^ while vierbeins that are needed to put -0 in the 
form rjpUi + rjaVi are /q = E^^ through /g = if^ 

Now, suppose the transformation from the e-basis to /-basis, e^^f, lies in 
the connected component of identity matrix. In other words, they are either 
both forward- moving or both backward-moving. In either case, they are both 
forward-moving relative to each other. This means that we can write e~^f — 
exp(ln(e~^/)). Thus, ln(e~^/) can be viewed as generated by infinitesimal 
transformations. The infinitesimal spinor transformation that corresponds to 
ln(e~^/) is —j{\a{e~^f))ni, a^", where a^" — ^[7'*, 7'']. Now, by exponentiating 
it back, we will get the finite spinor transformation corresponding to the trans- 
formation between these two coordinate systems: exp{ — ^(ln(e~^/))^,y cr'^'^}. 
Thus, 

^A^ = {Xp{ui\+Xa{vi\)AeM-iiHe-'f)),u<JniVp\ui)+Va\vi)) (5.114) 

Now in the case where one reference frame is forward-moving and the other 
one is backward-moving, all we have to do is insert a time-reversal operator 
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inside the log, namely exp{ — ^(ln(T'L ^M))^,^cr''^}, where T is time reversal. 
This gives us 

IK^ = (xp(Mi|+Xa(^^i|)ATexp{-^(ln(e-V))^,a'^'^}(r;p|«i) + r;J^;i)) (5.115) 

From the point of view of mathematical rigor, the equation we just got is as far 
as we can get. However, it would be fruitful to note that there is an intuitive 
correlation between that equation and the concept of "spinning", which the 
word "spin" represents. This discussion is not rigorous, and can be skipped 
by mathematically minded readers. 

In order to visualize the "spinning" that goes on, one can replace in- 
finitesimal points in spacetime by small arrows. However, unlike the way it 
is normally done, these arrows will not be aligned with a spin axis. Instead, 
these arrows will be, themselves, spinning around some other axis. Thus, spin 
around the z axis can be visualized as a vector pointing in the x direction 
whose end is moving in the y direction. Now, due to the fact that spin is 
subject to Lorentz transformations, we also need to know the reference frame, 
and it is given by Ai^. And finally Di^ is a cross product of with in a 
reference frame in which the particle is at rest. Thus is what is usually 
thought of as the direction of spin. 

Now let's look at each term in the Lagrangian to see what it represents. 
First, consider the C^A^d^B^ term. In the reference frame of the point around 
which the arrow spins, A'^ = 5^o is just a vector pointing along the t axis. Thus, 
in this reference frame, the term becomes C^dQE^. Now, if we visualize as 
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pointing along the x axis and along the y axis, this expression reads off as 
"how fast does the end of the x axis move in the y direction" ? Note that the 
end of the x axis can also move in the z direction, but this speed would simply 
have no contribution to the Lagrangian. In other words, the way to think of it 
is this: A^, and are vector fields with a weird coupling between them. 
They are coupled to each other in such a way that the end of vector B'^ is 
"forced" to move in the direction of the vector by the Lagrangian, which 
would then be interpreted by the observer as a rotation around D^. 

Now let's look at the A^'B''^^C^ and A'^B^'d^C^ terms. One difference be- 
tween the first term and these two terms is that in the "directional derivative" 
part of the equation (which in the first term is A^dv) A^ is being replaced by 
B^ and C^, respectively. This means that the differentiation is no longer in 
the time direction, but rather in a spatial direction. The interpretation of this 
might be that, as opposed to speaking of the rotation of one arrow (in which 
case it travels along the y direction) we are comparing the motions of different 
arrows (and the spatial direction differentiates the arrows we are looking at). 
The other difference is that while the first term refers to angular motion, both 
of the other two terms refer to linear motion: A'^B'^di^C^ refers to "boosting" 
of the spatial axis in the time direction A^ (which would be proportional to 
the negative velocity in the direction), while B^Cdi^A^ refers to "boost- 
ing" of the time axis A^ in the spatial direction 5^ (which can be interpreted 
as a positive velocity in the direction). Thus, by identifying B'^ with the 
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X axis and with the y axis, the Lagrangian tells us that we "want" points 
that are located away along the x direction to move in the negative y direction, 
and we want points located away along the y direction to move in the positive 
X direction. This is equivalent to saying that we want to have a "planetary 
system" and we want the orbits of the planets to have spin in the —z direction. 

This can be summarized as follows: we can envision space to be con- 
structed of mini-atoms. The first term in our Lagrangian tells us about the 
spin of each electron in an atom, while the last two terms tell us about the or- 
bital rotation of electrons. However, these atoms are "glued together" so that 
the orbital rotation of one atom gets "passed" onto neighboring ones, which 
is why this actually looks like a derivative globally. At the same time, while 
there is "cohesion" between atoms in the second two terms, the first term has 
no such thing: the arrow is infinitesimal and doesn't extend to a neighboring 
point. Similarly, when we talk about linear motion in the last two terms, this 
linear motion is really a similar arrow pointing in the t direction, which is also 
infinitesimal. Thus, while there is spatial cohesion in the second two terms, 
there is no time cohesion. 
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6 Chapter 6: Conclusion 



A causal set is a locally finite set of points, the only reliable structure for which 
are causal relations. The assumptions normally made for ordinary physics, 
such as manifoldlike-ness and symmetries, no longer apply to causal sets. This 
allows the causal set to have more general, non-manifoldlike, structure, while 
the manifoldlike structure arising on larger scale is a possible result of more 
general, non-manifold-based, theory. 

Causal set theory appeals to observation made by Hawking and Malevent 
that a manifold structure can be completely identified based on the scaling 
together with the causal relations between its events. In discrete case, the 
scahng is identified as simple count of points, thus causal relations are viewed 
as the only defining factor for gravitational field. Strictly speaking, causal 
relations are defined as a partial order. 

Scalar field on a causal set are defined as one point real or complex val- 
ued functions; Vector fields on a causal set are defined as a two-point real 
valued function. Grassmann numbers are defined as an individual elements 
of a vector space equipped with commuting dot-product and anticommuting 
wedge- product, and spinor field is defined as a combination of vector field, 
corresponding to vierbeins, and Grassmann-valued one point functions. 

Two classes of fields are postulated: type 1 and type 2. For every type-1 
field, a Lagrangain generator is defined as a function of the behavior of that 
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field together with a choice of few "representative" points. It turns out that if 
Alexandrov set is selected in such a way that would minimize the fluctuations 
of Lagrangian generator for the choices of points varying within its interior, 
this would result in Lorentz covariant expression. 

For type-2 fields, on the other hand, Lagrangian corresponding to a given 
Lagrangian generator is a linear combination of two different integrals of that 
generator; the difference in the integrals is due to the difference in the choice 
of parameters integration is performed. Arbitrary constant is introduced as 
a multiplying factor of one of these integrals, and minimization is performed 
mainly with respect to that constant; the minimization with respect to the 
choices of Alexandrov set is unimportant in linear cases, but it is retained 
to handle non-linear situations and specifically to adress lightcone singularity 
that might arise. 

Whether the field is type 1 or type 2, the final expression can be thought 
of as a relativistic covariant generalization of non-covariant expression used 
for Lagrangain generator. If Lagrangian generator is properly chosen, this 
will typically produce Lagrangians we are used to. Since in standard physics 
Lagrangians, too, are introduced as relativistic generalizations of non-covariant 
expressions, what was done for causal sets is simply a non-manifold equivalent 
of what we already used to. 

Relavitistic covariance is obvious from the construction: the only topoogy 
used is causal relations which is inherently covariant. Thus, it is shown that 
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if any field is viewed as type 1, we do end up with covariant expression. The 
purpose of introducing type-2 fields is mainly to handle unwanted terms that 
might arise in type-1 cases, although these terms are still very much covariant. 

For example, in the case of a type-1 scalar field, possibly appears 

with different coefficients, depending whether it is positive or negative at a 
given point. For a type-1 gauge field, an unwanted term Sap-ysF'^l^F'^^ came 
along, and for type-1 gravity, unwanted contractions, such as R^^Rap appear. 

However, in each of the fields except for gravity, this is dealt with by 
a simple addition or subtractions of different terms, which makes gravity a 
prime reason for introducing the notion of type-2 fields. However, once the 
notion of type-2 fields exists anyway, we might as well apply it to other, non- 
gravitational, fields. The prime motivation for doing the latter is to be able 
to postulate SU(n) symmetry for gauge fields. 

For type-1 electroweak field, we are forced to view electromagnetic and 
weak interactions as completely separate, and the appearance of symmetry is 
only a result, rather than a cause, of similarity of Lagrangians. To make it 
worse, symmetry is only approximate rather than exact in discrete cases. On 
the other hand, if electroweak field is viewed as type 2, the symmetry can be 
formally postulated on a fundamental level and, consequently, it is exact. 

Finally, in an Appendix that follows, a model of relativistically-covariant 
collapse of the behaviors of the fields, including gravity, was introduced. It 
is assumed that the existing, non-fluctuating, fields co-exist with the other 
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versions of these same fields fluctuating within a "corridor" around the non- 
fluctuation configurations. Thus, the probability of each non-fluctuating con- 
figuration is simply a path integral of the fields fluctuating inside of the cor- 
responding corridor. While this model has an advantage compared to de- 
coherence in a sense that it is manifestly covariant, it was argued that in 
non-relativistic case it reproduces the key predictions of decoherence model, 
including entropy being a defining feature of measuring apparatus. 

That model allows one to "collapse" causal relations into a specific con- 
figuration, despite the fact that they, being identified with gravitational field, 
undergo quantum fluctuations. This allows for the topological background 
necessary to be able to introduce propagators. However, while this adresses 
the issue of topological background, it does not adress the issue of it being 
manifold-like. It turns out that a separate Lagrangian generator needs to be 
introduced for that purpose alone (see sec 5.8). There is a qualitative ar- 
gument that causal set, subject to Lagrangian generator of sec. 5.8 is more 
manifold-like than it would have been otherwise, but it is not clear whether or 
not it is sufficient to make it manifold-like. This is a subject of future research. 

As we have seen in this dissertation, a fundamentally different way of 
viewing causal sets was presented. Instead of statistically generating it as is 
traditionally done, it was claimed that a set of Lagrangians should be a starting 
point of generating a causal set. This amounts to saying that geometry is no 
longer a background of Lagrangians but rather the set of Lagrangians is a cause 
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of geometry. This is in line with a traditional picture of general relativity that 
metric is a gravitational field which is just as much of a field as any other field. 

However, the generating Lagrangians that were used, especially the fermionic 
one, might look "manufactured" and one might have a question as to why not 
use some other Lagrangians? There are two answers to this argument. The 
first answer is that the same question can be asked for standard theory as well. 
For example, in case of Einstein-Hilbert action, we can ask why do we have R 
instead of in the Lagrangian. 

The fermionic Lagrangian serves even better example to illustrate the 
point. While it is true that the Dirac equation was originally invented as a 
square root of Klein-Gordon equation, this is not true for the Lagrangians. 
So, in a desperate attempt for something to "look" like a square root on a 
classical limit, they had to cleverly "manufacture" a Lagrangian that would 
produce that. So, if they are allowed to do that, why can't we? In particular, 
we might as well be even more clever and manufacture a Lagrangian generator 
for a fermionic field, so that, after a few-step process of first getting actual 
Lagrangian, then applying it to a manifold, and finally taking classical limit, 
we will get "square root" at the end of the day. 

Of course, however, the argument "if everyone can do it badly so can I" 
is not a good one. Especially if I am trying to come up with a fundamental 
theory. Thus, this is not an official standpoint of the theory, and I consider 
the issue to still be unresolved. The only reason for the above argument is to 

176 



explain why my theory "has a right to hve" . But the fact that the theory has 
a right to hve does not imply that all of its problems are resolved. In fact, to 
this day, none of the theories can make that claim. 

Another argument in defense of that theory is that, in my opinion, the 
above weakness is not as serious as the foundational problems of physics the 
theory proposes to solve. After all, if we were offered an explanation of all of 
our experiments in terms of Newton-based classical physics alone, few would 
object, even if it meant that we were to introduce some complicated-looking 
forces that were not originally present in Newtonian physics. 

Of course, the response to this argument is that this theory, as it is, does 
not claim to "explain all experiments on modern physics". On the contrary, 
it's only hope is to reproduce some of the most basic things that are otherwise 
taken for granted: Lagrangians as well as basic geometry. Even then, the 
manifold-like structure that is being predicted in section 5.2 is approximate at 
best. However, in light of the fact that this area of physics is very young, it 
is hoped that situation will change in future when more research is done. In 
this light, this dissertation should be viewed as a necessary starting point for 
that future work. 

It should also be understood that there are technical obstacles in develop- 
ping this theory to the point of making predictions. In case of square lattice, 
it is possible to systematically perform path integration over arbitrary many 
degrees of freedom since the entire information can be captured in the defi- 
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nition of a lattice. In case of causal set, in light of the randomness of causal 
structure, this is no longer true. Thus, if we were to perform path integral, 
we would have to do so numerically rather than analytically and we have to 
separately count each degree of freedom. 

In my visit to Perimeter Institute, Sorkin told me that if my causal set 
consist of 100 - 1000 points, it would take few years for the modern computer 
to finish calculations with the Lagrangains that I proposed. However, my 
hope is to come closer to be able to replace the numeric methods with analytic 
ones. Of course, actually doing that is simply unrealistic. But it is possible to 
do something in the spirit of perturbation theory: assume that somehow we 
know the value of Z{-<o, JFq) and try to calculate Z(-<, J-") provided that T is 
sufficiently close to J^o and -< is sufficiently close to ^o- 

On the other extreme we can do something else in parallel: instead of 
using small-number-simplifications characteristic of perturbation theory, I can 
use large-number-simplifications characteristic of classical limit. This would 
predict behavior of classical objects which would serve as a potential test of the 
theory. Furthermore, by applying it to black holes, we can have a conceptual 
framework to try to tackle questions such as information paradox. 

It should also be remembered that even classical Einstein's equation is 
only solvable analytically for the most simple situations, such as spherical mass 
distribution. This, however, does not disqualify classical general relativity from 
being a valid theory. So, even if I would not be able to make any predictions 
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beyond black holes, I can claim that black holes to causal set theory is the 
same as spherical mass distribution to general relativity. This might be the 
ultimate defense of causal set theory against such objections. 
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A Appendix: Interpretation of Quantum Me- 
chanics on Causal Sets 



A.l Quantum Gravity and the Need for an Interpreta- 
tion of Quantum Mechanics 

By viewing causal relations as a quantum field, one is forced to admit that 
causal relations undergo quantum fluctuations. If the concept of quantum 
fluctuations is left unaltered, causal relations would be forced to undergo all 
possible structures, which means that there is no appriori causal relation. This 
leads to absurd situation: without any apriori topology, any pair of points is 
apriori just as close to or just as far from each other as any other pair of 
points, which means that the propagators between any pair of points should 
be identical! 

The solution to this problem can be accomphshed as a special case of 
a solution to a more general problem, the one of interpretation of quantum 
mechanics. Essentially, a successful solution to the latter is a theory that 
"localizes" fluctuating quantities. Thus, if successfully done, its application to 
gravitational field will provide the necessary background topology to introduce 
propagators. 

Before proceeding, it is important to ask a question whether or not it 
is possible to "temporary ignore" the problem of interpritation of quantum 
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mechanics the way it is done in standard non-gravitational quantum field the- 
ory. In flat space quantum fleld theory, "sources" and "sinks" designate the 
presence of classical objects that cause the collapse of wave function. These 
classical objects are localized in spacetime, which means that there is some 
unknown mechanism to allow such localization to occur. 

It is possible to deliberately overlook the question of localization of sources/sinks 
by simply saying, for now, that all the sources/sinks are color blue while all 
particles are color red, and only color red is fluctuating. Similar thing can 
be tried for spacetime by picturing that spacetime, being color blue, is non- 
fluctuating. But then it is no longer possible to derive Einstein's equation 
from variation of the action, since the latter is nothing but a classical limit of 
path integration, although one can attempt to go around this issue by simply 
imposing approximate validity of Einstein's equation as a constraint (this is 
done in the last section of the Appendix) 

This, however, in itself will be "more" than the "formal" prescription 
would give. Thus, since we have to do that anyway, nothing stops us from 
saying that we "don't like" imposing approximate validity of Einstein's equa- 
tion by hand and, instead, we "like" a lot more the idea of interpretation of 
quantum mechanics way of "localizing" the gravitational fleld. This will be 
the mindset of most of the chapter. 
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A. 2 Quantum Corridors 

The standard way of interpritation of quantum mechanics is the one of deco- 
herence. However, this model is not acceptable for two different reasons. First 
of all, it viewes parallel universes as non-interfering components of a single 
wave function of one, large, universe. As such, they will interact with each 
other gravitationally, which means that they will not be trully parallel once 
gravity is introduced. Secondly, the definition of wave function of a universe 
is based on the concept of "simultaneous events" which violates relativity. 

For this reason, in this section a new model of interpritation of quantum 
mechanics will be introduced that avoids the above difficulties. Its comparison 
with decoherence model will be discussed in more detail in the next section. 
The idea is to replace the cross graining in spacelike hypersurface which vio- 
lates relativity with coarse graining in the set of spacetime histories which is 
manifestly covariant. 

The idea of doing that have been around for a while. For example, it was 
mentioned by Mensky in Chapters 5 and 6 of Ref [16j as well as by hartle in 
Chapter 8 of Ref [20]. Since I didn't have time to study these references in 
detail, I am not sure regarding the extend to which my approach differs from 
theirs and whether these differences are good or bad. Nevertheless, I chose 
to follow more closely Mensky than Hartle just because I understand it bet- 
ter, and I will borrow his terminology of "quantum corridors" and "quantum 
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tunnels" . 

Since my reading of Mensky was relatively superficial, it is not clear 
whether he was implying the same procedure as I was carrying out here. If his 
corridor was referring to a size of a lab or some other variable that depends 
on specific circumstances, I would disagree; on the other hand, if his corridor 
has a fixed size, given by a constant of nature, it would be exactly what I am 
trying to do here. 

It is also unclear just how far from this approach Hartle stands. The 
"words" part of his Chapter 8 seem to match closely with what I propose in this 
chapter and also the way I would paraphrase Mensky, although mathematics 
part is rather different. On the surface reading, his mathematics is a lot more 
complicated than mine. Potentially, this might mean that my approach is too 
simplistic and in need of further elaboration. Once again, this is something I 
am leaving for further research. 

Let's get down to business. One can argue that when quantum system is 
being measured, it is not literally being seen; rather, an observer sees an arrow 
on the measuring apparatus pointing in a certain direction. That arrow is a 
classical object. Thus, the ultimate goal of the theory is simply a prediction 
of behavior of classical objects, and nothing more. 

It is proposed that the entire classical history of the universe - both distant 
past, distant future, and everything in between - is a single outcome of a 
single measurement performed by a single observer living outside the space- 
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time "system" (or a causal set if you will). This is a "quantum field theory" 
version of measurement as opposed to "quantum mechanics" one. That is, 
fields are measured point by point. 

The degree of precision of the measurement is identical at each point of 
the universe, and has nothing to do with any labs that might and might not 
be present there. After all, lab itself is still part of a quantum system, which 
means that it is an outcome of the measurement as opposed to a setup. 

As far as setup is concerned, no spacetime event is apriori different from 
any other one, thus it will only make sense that the degree of measurement 
of the fields at each point is identical. That degree of precision can be viewed 
as fundamental constant of nature, just as fundamental as speed of light or 
charge of electron. 

A classical behavior of a system can be viewed as a collection of all possible 
quantum mechanical behaviors that are "compatible" with it, which will be 
denoted as C. The definition of "compatible" is equivalent to the degree of the 
above described universal measurement. The probability amplitude associated 
with C is given by 



Now, the only "acceptable" choices of C are the ones for which any two of its 
elements are "approximately equal" to each other, with respect to approxima- 
tions done up to smallest "classical" scales. This, of course, required definition 
of approximation. 




(A.l) 
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In case of a scalar field, 0i ^ 02 means that 0i ^ 02 as long as |0i(x) — 
4>2{x)\ < e for every single value of x. It is then easy to see that there is one 
to one correspondence that associates every choice of C with the average value 



Thus, a range of fields can be replaced with a single, localized, field 0, 
and the probability density associated with each of the versions of such field 
is given by 



where the integration will be replaced by a sum in a causal set. 

In the above expression, in light of the fact that the range is constrained 
to a certain width, there is one to one correspondence between a field 0o and 
the range of variations of 4>. Thus, the above will define the probability of 
fixed values of 0o as opposed to the one of a range of values of 0. The issues 
of consistency with decoherence theory will be adressed in the next section. 

There is another issue: different nearby fields 0o might resemble each other 
so well that the same probability can be counted many times which would skew 
the results. This can be adressed by replacing the definition of probability with 
rigid yes-or-no criteria of whether a given history is "allowed" . That criteria 
is coupled with a principle that there exist exactly one parallel universe to 
"realize" each of the "allowed" histories. The goal is for that to imply that 
the probability of a system falling into a history that looks a certain way is 
consistent with the probability predicted by the above equation. This can be 



of e C. 




(A.2) 
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done by postulating the following constraint: 

CONSTRAINT 1: A field 0o is allowed if and only if |Z(0(x) = Mx))\'^ > 
f{\Z{(j){x) = 0o(a;))P) where f{x) is a decimal expression of x starting from 
the 100-th digit. 

In above the constraint, the function / was used as random number gen- 
erator, and the probability of x > f{x) is proportional to x as long as x is 
between and 1 

In case of causal set, the value of 0o is discretized in order to remove 
potential conceptual difficulties by making the number of parallel universes 
finite. This amounts to imposing another constraint: 

CONSTRAINT 2: for any point p, 0o(p) is a factor of some small number S 
For vector fields similar trick is done: 



where f{x) is the decimal expression of x starting from the 100-th digit. 

CONSTRAINT 2: for any point p and g, v{p,q) is a factor of some small 
number 5. 

Now the key to localizing geometry is to do similar trick to gravitational 
field. This requires a definition of a corridor on a space of causal relations (i.e. 




(A.3) 



CONSTRAINT 1: A field v{p, q) is allowed if and only if 



Z{v{p,q) = Vo{p,q))\'^ > f{\Z{v{p,q) = Vo{p,q))\'^) , 
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gravitational fields). 

A corridor around causal relation ^ is a set of all causal relations -<* such 
that the respective gravitational fields defined in terps of -< and -<* approx- 
imate each other. This is defined as "neighborhood" of and denoted as 
n{-<, e, N) (where N is the a low bound on a distance scale imposed in order 
to avoid unwanted discrete effects) and G{T, -<) is replaced with H{T, e, N) 
where H{T, e, N) = G{T, -<) n n{^, e, N) 

In light of the fact that classical Einstein equation can be derived by means 
of variation of an action with respect to g^^,, ideally one would like to define 
n{~<) in terms of small variation of the same. However, imposing separate 
restrictions for each of the choice of /i and u is not a relativistically covariant 
procedure, which means it is not well defined for causal set. However, due 
to the fact that length of geodesic depends linearly on Qui, this becomes an 
easy replacement. Timelike geodesic is defined much simpler than spacelike 
geodesic is, so the former will be used. It has to be taken into account that, due 
to the fact that since -< and -<* are distinct, some pairs of points are related 
by one causal relation and not by the other, which means that only one of the 
two geodesic segments in question is present. The way to deal with this case is 
to notice that if gf^^ ~ g^^ then whenever the Lorentzian distances according 
to these two metrics differ in sign, they should both be close to 0. Thus, a 
constraint to impose is that if p ~< q AND their distance is large according 
to -<, then p -<* q. The constraint regarding comparable geodesic length 
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should also be imposed. This, too, requires the scenario where two points 
have large enough Lorentzian distance according to relevent partial orders, as 
this would reduce stochastic fluctuations in manifold-like scenario. These two 
constraints can be combined into one: ii p -< q and they are far enough from 
each other according to then p -<* q and their distance according to ^* will 
be comparable to the one according to -< : 

DEFINITION: Let ^ be a partial order on a causal set S. Let e be some small 
real number and let be an integer. Let -<* be some other partial ordering 
on S. Then -<* is an element of n^(^,e, A^") if for any points p ^ ri ^ ... ^ 
tn < q there are points si,...,sm satisfying p -< si -<...-< sm -< q where 
M > (1 — e)A'^. Likewise, for any points p -<* ri -<* ... -<* -< q there are 
points si, Sm satisfying p ~< si ~< ... ~< sm ~< q, where M > (1 — e)A^. 

The meaning of the subscript r in rir is that n is defined in terms of 
Lorentzian distances, r. It is also possible to define n in terms of volumes of 
Alexandrov set: 

DEFINITION: Let ^ be a partial order on a causal set S. Let e be some small 
real number and let A^ be an integer. Let -<* be some other partial ordering 
on S. Then -<* is an element of ny(^, e, A^) if whenever there are more than 
N choices of r satisfying p ^ r ^ q, there are also more than (1 — |)A^ choices 
of s satisfying p -<* s -<* q. Likewise, if there are more than A" choices of r 
satisfying p ^* r ^ q, there are also have more than (1 — |)A" choices of s 
satisfying p ~< s ~< q. 
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The reason | was used in the above definition instead of e is to make 
sure that geodesic length, as opposed to the volume, deviates by order of e 
since the former rather than the latter linearly depends on the metric. Similar 
conversion was not done for N since the latter is just intended to put some low 
bound on the number of points to avoid discreteness effects, which means that 
it doesn't make a physical difference whether it refers to lengths or volumes. 

Of course, this still doesn't adress the renormahzation issue. But since 
causal set theory can be done numerically, the theory is still well defined. 

A. 3 Quantum Corridors vs Decoherence: The Non- Gravitational 
Case 

In order for previous section to stand, it has to be argued that interpritation of 
quantum mechanics proposed in the previous section is consistent with more 
standard decoherence model. Of course, there is no exact match between the 
models - for one thing, decoherence model is non-relativistic while tunnel one 
is. However, it is still possible to show that the model at hand reproduces 
key predictions of decoherence model - in particular the fact that the defining 
difference between "measuring apparatus" and other systems is large entropy 
of the former. Since decoherence theory is non-relativistic, it is suffice to show 
that the two theories agree in non-relativistic case, which is what most of 
the argument will be devoted to. Meanwhile, it will be apparent that tunnel 
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theory is written in relativistic form, hence it is Lorentz invariant. 

According to the decoherence theory, the defining feature of quantum mea- 
surement hes in a very large number of degrees of freedom of measuring device. 
Take, for example, locahzation of a particle on the screen. The complete sys- 
tem, consisting with incident particle as well as all the particles that compose 
a screen, can be represented as a point in a phase space. If there are total 
of n particles, the phase space has 3n dimensions, which correspond to space 
dimensions of each of the particles. This system evolves in non-relativistic 
time t according to Schrodinger's equation, with evolution equation v{t). 

The screen can be viewed as a collection of many different small pieces 
glued together, numbered 1 through n. After the interaction of a particle with 
a screen occurs, the resulting state becomes 



where a state \vi{t)) is a component of a wave function corresponding to the 
possibility of interaction of a particle with a piece number i. 

In case of absolutely smooth screen, {vi{t)\vj{t)) 7^ as long as t ^ to, 
where to is the time that a particle collapses on the screen. However, in real 
situation the screen is NOT totally smooth; rather, it is a complex system 
involving a lot of different particles. Thus, in the real life 




(A.4) 




(A.5) 
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On the one hand, it is still true that 



{Vij\Vkl) 7^ . 



(A.6) 



However, due to the averaging up of all the different degrees of freedom. 



Physically this means that there is no well defined phase shift between the 
component of wave function reflected from pieces i and j due to the fact 
that there is too much variation of phases of Fourier components of signal 
emitted from each of these two pieces. Consequently, all interference terms 
cancel, which means that the law of addition of probability amplitudes become 
equivalent to the law of addition of probabilities themselves. Mathematically, 
this is expressed as 



{Vi{t)\vj{t) = ^ {v,{t) + vj{t)\vi{t)+Vj{t)) = {v,{t)\vi{t)) + {vM^jit)) ■ 

(A.8) 



Since both \v) and \vi) correspond to functions in a phase space rather than 
a position space, they take into account the behavior of all existing particles 
put together, which means that there is nothing "outside" of these functions 
to interact with them. What is normally viewed as interaction now becomes 
an evolution of free state. In light of this, the fact that and \vj{t)) 

stay non-overlapping means that they represent imaginary quantum systems 
evolving completely independently of each other, although sharing the same 
space. In other words, they act hke "parallel universes" . 




(A.7) 



k,l 
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On the first glance it might seem counterintuitive that the two peaks 
would never overlap with each other in the future. After all, two delta functions 
evolving according to Schrodinger's equation are predicted to overlap. In order 
to answer this, we have to remember that above argument was not made in 
real space, but in imaginary phase-space. In order to get a physical intuition of 
what actually happens, we have to "translate" that into a real space language. 

If we have a classical object, such as billiard ball, it has a basic trajectory 
plus the internal distribution of its molecules. Its representation in position 
space takes into account the former and ignores the latter, while its represe- 
nation in phase space takes both into account to the same extend. On the 
one hand, a billiard ball can eventually reach any space location, regardless of 
the hole it moved through. On the other hand, however, its complex internal 
structure allows it to "remember" which hole it passed. Thus, if it passed 
through a hole A, it will never be able to reach a state of "remembering" that 
it passed through a hole B and vise versa. 

Roughly speaking, a phase space of a system of particles is its location 
on the actual space plus its memory. All systems, whether simple or complex, 
are very likely to reach the same space locations at some future point after 
very different behaviors in the past. Thus, the lack of overlap in phase space 
is blamed entirely on memory. 

Parallel universes, being defined in phase space, take memory into account 
just as much as the do position. Thus, in order for a system of particles to 
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move into a different universe and then come back to the ear her one, it has to 
be able to "erase" aU of its memory. Thus, its abihty to do that is a key to 
whether it is classical or quantum mechanical. 

Electrons, being one-particle systems, have no memory. Thus, the in- 
evitable overlap in position space will imply overlap in phase space. This cor- 
responds to earlier mentioned observation that the two delta functions evolv- 
ing according to Schrodinger's equation are predicted to overlap. Speaking of 
"erasing the memory" comment, electrons "erase" their memories every sec- 
ond. Thus, they jump between parallel universes all the time, so, effectively, 
each of these parallel universes has all possible locations of each electron. 

Few particle systems do have some memory, but that memory is not very 
detailed. Thus, they can remember that they were in graduate school, but 
they won't remember whether it was in Montana State or University of North 
Texas. This means that they can still go through holes far away from each 
other and end up having identical memory, resulting in possible overlap in 
phase space. 

However, once the system reaches sufficient complexity, its memory is 
detailed enough to remember exact numbers of leafs on the trees and stones 
on the ground. As a result of the entropy of the environment, it is extremely 
unlikely for two different cities to match in all these details. Consequently, the 
probability for the system to pass through different holes and end up having 
the same memory is very small. This is equivalent to the earlier statement 

193 



that the components of wave function on phase space are non-overlapping. 

Now let's see whether this can be reproduced by quantum tunnels. Ac- 
cording to the tunnel model proposed in this paper, the tunnel is defined in 
terms of pointwise values of quantum field as opposed to the locations of each 
particle. 

This is done for a good reason: the reason quantum tunnel was intro- 
duced on the first place is that, unlike decoherence model, it is relativistically 
covariant. Implication of being relativistically covariant is the use of second 
quantization instead of the first. This means that the position degrees of free- 
dom corresponding to each particle are replaced with field degrees of freedom 
of each point. 

However, this leads to the following question: a billiard ball can not 
"shrink" to one point. Thus, no matter how large it is, each point will only 
have at most one of its electrons. The field of each electron is much smaller 
than the width of the tunnel, as evidenced by the fact that electrons are 
quantum mechanical and not classical. So, then, how can pointwise-defined 
tunnels distingush the behavior of billiard balls, or their memories, no matter 
how large they are? 

I propose to answer this question in the following way. While electrons, 
being fermions, can not occupy the same location, the photons, being bosons, 
can. Thus, in order to be able to claim that billiard ball, being "large" has field 
value greater than the tunnel width, we have to claim that the photons emitted 
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from the different particles that comprise it all met at the same location. 

Now, on average, this does not happen. If the photons are evenly dis- 
tributted throughout the billiard ball, then every point of the billiard ball will 
on average receive as many photons as it would have if it was one particle 
system. However, due to entropy, there are some random high peaks. 

While, of course, this is subject to further research, I hypothesize that the 
size of some of these peaks is larger than the width of the corridor. Thus, the 
tunnel can detect the magnitudes and location of these peaks, and nothing 
more. However, if the billiard ball is sufficiently complex, the distribution of 
these peaks will be complex as well. 

I claim that these peaks alone serve as a "code" for the memory of the 
billiard ball. If that ball passes through different locations in space, it might 
still have similar peaks, but these peaks might be slightly shifted as a result of 
interaction with an environment. Since, again, the measurement is based on 
second quantization rather than first, that tiny shift of the peaks is detected 
no matter how small it is. 

After all, no one measures position of each peak. Instead, the field strength 
is measured point-wise. Thus, an infinitesimal shift of a peak is equivalent to 
two totally independent events: its annihilation at point r and its creation at 
point r + e. Both events are on a scale larger than the width of a tunnel, hence 
both are detectable. 
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A.4 Including Fermions in the Picture 

As was mentioned in the previous section, our ability to measure the position 
of biUiard ball is due to the photons emitted by its different particles meeting 
at the same location. This is a consequence of photons being bosons since 
fermions are not allowed to occupy the same state. Thus, the implication of 
the theory is that bosonic fields are the only ones being measured, and we 
simply infer the behavior of fermions based on the bosons they emit. 

However, in this section, I will ignore the above argument and try to 
restrict fermions into a tunnel anyway just to see what happens. It will be 
shown that this would only lead to absurd results, which would further justify 
the proposed idea of measuring only bosons. 

In light of the fact that Grassmann variables are defined in a literal sense, 
one might ask the following question: is it possible to confine fermionic field 
to a corridor the way it was done with a bosonic one? Unfortunately, the 
answer to this is no. After all, if a corridor was narrower than the width of 
the ^ measure imposed on Grassmann space, then the integral of ^ would no 
longer give , which means that the predicted behavior would no longer be 
fermionic. 

It would not necesserely be bosonic either, since the corridor might be 
wide enough for SOME variation of ^ to occur, which would contradict the 
constant measure for bosonic case. Thus, the prediction of the theory would be 
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the presence of bosons, fermions, as well as in-between particles. Bosons would 
correspond to a situation of corridor being much narrower than the width of 
the ^ function, fermions would correspond to the corridor being much wider 
than the width of the ^ function, and in-between particles would correspond 
to the width of a corridor being a fraction of a width of ^ function, but that 
fraction is not too small. 

The advantage of this approach is that if there was a way to somehow 
exclude "in-between" particles, it would be possible to claim that both bosons 
and fermions have the same measure, namely ^ function, which might help with 
a unification arguments. It would also be possible to hypothesize that, in the 
similar way as the locations of centers of corridors are being excluded based on 
low probability, the same applies to their widths. Thus, spin statistics theorem 
implies that both narrow corridors for fermionic Lagrangian generators as well 
as wide corridors for bosonic Lagrangian generators give near-zero probability 
density. This would "select out" the corridors where spin statistics theorem 
works. 

However, in light of the fact that weight function is unnatural, it is ques- 
tionable whether imposing it on bosons for consistency sake is worth it. Be- 
sides, total consistency would not be achieved anyway since for bosons dot 
and wedge products characteristic of Grassmann integration will not be impli- 
mented. Finally, there is no natural way of excluding "mixed particles" that 
are neither bosonic nor fermionic. 
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Of course, one can still impose quantum corridors on fermions in order to 
be consistent with the setup of the theory. But, in order to avoid the above 
difficulties, one would have to restrict himselves only to the corridors that are 
much wider than non-zero region of ^ function. Thus, they would either contain 
all of ^ or none of it. In the former case, one would have regular fermionic 
behavior, but one would not be able to specify exactly into what part of the 
peak the system is localized. In the latter case, the integral will be which 
means that the whole scenario is ruled out by the constraint |Zp > 

However, it can be argued that being able to locahze bosons is enough. 
Consider, for example, a double-slit experiment. When an electron hits the 
screen, the observer doesn't literally see the electron. Rather, he sees light 
coming into his eyes that is emitted from the location where electron sup- 
posedly hit the screen. That light is electromagnetic radiation. Since the 
electromagnetic field is bosonic, it can be locahzed. Thus, the statement "it 
is not likely for electron to fly from point A to point B" can be replaced with 
the statement "it is not likely for light to first be emitted from point A and 
then be emitted from point B. After all, the main way this can happen is by 
electron to travel from point A to point B and the latter is unlikely" . 

In the language of Feynman diagrams this means that all of the external 
lines are only photons, while internal ones can be both photons and electrons. 
As a result of that, electrons have impact on photon-alone scattering process. 
Since measurement predicts only external lines rather than internal ones, quan- 
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turn corridors are only applied to photons. However, since Lagrangian includes 
internal lines as well, the electron part of the Lagrangian affects the probabil- 
ities of photon-based corridors. 

The localization of electron in space can be treated in the similar way. In 
the hypothetical situation, if fermions were localized directly, it would have 
been argued that the "initial configuration" of fermions had large enough en- 
tropy for decoherence to occur. On the other hand, in the real situation elec- 
trons are being replaced with the photons in above argument, which means 
that the claim of the theory is that the initial configuration of photons forces 
decoherence. Indeed, if initial configuration of photons indicates a specific 
structure of electrons, then it would be highly unprobable that it happens 
without such structure of electrons actually being there. This means that the 
main contribution to path integral arises from electrons actually forming that 
pattern. 

This means that integral will predict a decoherence picture similar to the 
one that would occur if there were indeed electrons spaced in that fashion. 
That decoherence implies that certain corridors of photons are much more 
probable than others. In fight of the fact that due to decoherence different 
space locations of electron will imply very different behaviors of a photon, these 
"more probable" corridors correspond to localization of electron in space. 
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A. 5 Manifold Structure: Revisited 



In section 5.2 it was discussed how introducing vierbein fields might increase 
the hkehhood of manifold-hke structure of a causal set. However, at the very 
beginning of the section, an important disclaimer was made: due to the fact 
that path integral is taken over all possible geometries, there is no such thing 
as making a prediction of geometry to approximate anything in particular, 
regardless of degrees of approximation. Rather, the only thing that was done 
in section 5.2 was to make an argument that manifold- like geometries have 
higher weight in a path integral. But, whether we are adding up all possible 
geometries, or only all possible manifold-like ones, the fact remains that there 
is no overall geometrical background. 

But now in this Appendix, the model of "quantum collapse" of geometry 
(or, equivalently, collapse of causal relations) was introduced. This means that, 
regardless of presence or absence of Lagrangian generator of sec 4.4, there will 
indeed be the fixed geometry. On the other hand, if the Lagrangian generator 
of sec 4.4 is not introduced, there is no reason to expect that fixed geometry 
to be manifold-like. The causal structure, which is identified as geometry, can 
collapse to virtually anything, including, for example, tree-like causal relations. 

Thus, introducing Lagrangian generator that would encourage geometry 
to be manifold-like, and introducing collapse mechanism to make geometry 
fixed are two very different parts of the same puzzle. If these two parts are 
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both successfully done, then putting them together would lead to a manifold- 
like background for propagators to be introduced. 

As far as this thesis is concerned, the proposed solutions to either of these 
pieces of puzzle. 

Since a manifold-like structure can not be introduced by means of cur- 
vature, vierbeins are used to fulfill that purpose, and they are viewed as addi- 
tional fields. Lagrangian generator is given by 



where n is a very large number. It should be understood that a, b, c and d are 
viewed as fields rather than coordinates. 

Due to the fact that n is very large, it is clear that if that fraction is outside 
of a very small neighborhood of 1, one of these two terms will be very large, 
depending on whether fraction is greater than 1 or smaller than 1, which would 
cause the sum to be very large in either case. If rapid increase of Lagrangian 
generator will "leap" into rapid increase of Lagrangian itself, interference be- 
tween Lagrangians of nearby configurations will lead to probability amplitude 
being very close to 0. 

However, it is still okay for the Lagrangian generator to increase very 
fast, as long as it has no impact on Lagrangian. Since £ = min max JC, 
there has to be at least one Alexandrov set inside of which max /C is small. 
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Thus, around each point there has to be at least one Alexandrov set in which 
the Lorentzian equation for the distance nearly holds, but it is not true for 
arbitrary Alexandrov set. 

This, in fact, matches the observations for Lorentzian manifold. For exam- 
ple, suppose an electron is sent into a black hole with near-lightcone velocity. 
Then the proper time between emission of electron and electron reaching the 
center of black hole is very small. But it is not true that the flat space ge- 
ometry is approximately valid inside of the Alexandrov set defined in terms of 
these two events. However, it is still true that if electron was sent somewhere 
else, its velocity can, indeed, be adjusted in such a way that metric is, indeed, 
flat in its frame. This means that the statement of approximate local validity 
of Lorentzian geometry is true "for at least one frame" rather than "for every 
single frame" . This is exactly what Lagrangian generator is telling us. 

It should be noticed, however, that even if pa (l)~b^{p, (l)~c^{p, q)~ 
(PiPj q) is enforced, it does not imply a manifoldlike structure. In fact, for 
every single causal set, including the one with a lot of posts, it is possible to 
choose a, b, c and d pair-wise in such a way that the approximation holds 
for every single pair of points. What distinguishes manifoldlike causal set is 
that the number of such choices of a, b, c and d for manifoldlike causal sets is 
considerably larger than it is for non-manifoldlike ones. Thus, the key element 
of the theory is that larger number of similar choices of a, b, c and d implies 
larger probability. 
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This, indeed, was done in constraint 1 of section 2 of this appendix 

\Z? > f{\Z\') , (A.IO) 

where f{x) < 1 is the decimal expression for x starting from the 100-th digit. 

That constraint, effectively, replaces the notion of probability with a no- 
tion of "allowing" or "forbidding" histories. Every "allowed" history is repre- 
sented in exactly one parallel universe, regardless of its probability; however, 
the "density" of allowed histories approximately correlates with probability. 

Once all the fields are discretized according to constraint 2, there will 
appear a close correlation between the number of closely matching histories 
and the so-called probability. Consequently, if there is some other reason 
for the number of histories to be larger, it will imply larger probability. In 
particular, if causal relation -< happens to be manifoldlike, there will be larger 
number of choices of a, 6, c and d that are NOT ruUed out by the above 
constraint. Since every "allowed" choice is represented in exactly one parallel 
universe, this means that there will be a lot more parallel universes to represent 
manifold-like causal relation then there is to represent non-manifoldlike one, 
which is why randomly selected causal set is manifold-like with large enough 
probability. 
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A. 6 Appendix: Dyson-Based Model of Gravity for Con- 
tinuum Manifold 

In the first section of this Appendix it was stated that in order to have a topo- 
logical background for propagators, geometry, being identified as gravitational 
field, were to undergo quantum fluctuations, there has to be a mechanism to 
"collapse" fluctuating geometry into some fixed one in order to have a topo- 
logical background needed to propagate other fields. 

However, it was also mentioned that there is an alternative: a possibility 
suggested by Dyson that gravitational field does not exist altogether, which 
means that the geometry is not subject to quantum fiuctuations to start with. 
This possibility was put aside on the basis that Einstein's equation is a con- 
sequence of variational principle which, in turn, is a classical limit of path 
integration. 

Nevertheless, it was acknowledged that it is possible to avoid any reference 
to path integration by viewing Einstein's equation as a postulate as opposed 
to a consequence of variational principle. In this section, this alternative path 
will be taken up. 

One of the attractive features of this path is that by avoiding quantizing 
gravitation one can avoid dealing with non-renormalizeable theory. Further- 
more, this might allow one to postulate gravity in a usual continuum. The only 
reason to discritize space-time at all is renormalization of non-gravitational 
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fields which is something we are aheady used to from fiat space quantum field 
theory. 

It should be understood that this is a competing model with the one 
proposed in the rest of the thesis which means that this section can be skipped 
without compromising the understanding of other parts. 

In Section 2 of this Appendix it was found that laws of physics can be 
replaced with a set of all possible universes which are "constrained" in some 
way. This means that an additional constraint can be imposed while staying 
perfectly consistent with philosophy of the theory, namely the one that Ein- 
stein's equation is approximately satisfied. Of course, it can not be exactly 
satisfied since Bianchi identity would then demand a conservation of energy 
momentum tensor. But an approximate constraint can still be imposed. Thus, 
there are two constraints: 

CONSTRAINT: Let f{x) be a value between and 1 corresponding to the 
decimal expression of x starting from 100-th digit. A history {gn^{x), 4>o{x)) is 
allowed if and only if the following is true 

1) \Z{(f) = (f)o^,gf,^.)\'^ > f{\Z{(f) = (f)o;g^,u)\^), where Z{(f) = 0o;g^,.) is given 

by 

Z(<f>{x) = M^)) = j M exp ( j d^xyf^{iC{ct>-x) - k{ct>{x) - ct>o{x)f)) 

(A.ll) 

2) Einstein's equation approximately holds, where by "approximately" it is 
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meant 




(A.12) 



where 



(A.13) 



It should be noted that the above assumes that the values of T^i, has been 
localized throughout spacetime. This, of course, requires quantum corridors. 
However, while non-gravitational fields are still subject to quantum corridors, 
gravity itself no longer is. 

Since the former is renormalizeable while the latter isn't, this allows to 
compute the behavior of non-gravitational fields by standard methods and then 
apply the above constraints to estimate gravity without having to compute any 
graviton propagators. Then, based on estimated gravity, adjust the estimation 
of non-gravitational propagators, and based on that adjust the estimation of 
gravity, etc. In every case, the propagators are computted assuming aforegiven 
gravitational background. This allows to make predictions without dealing 
with non-renormalizeable theories. 
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